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Preface 


In this book the behavior of electrons in a crystal is described in 
terms of the one-electron model. Clearly, such a system would be 
best approached by a many-body formalism but because of mathe- 
matical complexities only a very few real problems can be solved in 
this way. In the one-electron model, each electron is considered 
separately as moving in the averaged field of the nuclei and all the 
other electrons. The resulting picture is simpler conceptually, much 
more tractable mathematically and, at the same time, is quite 
comprehensive, which makes the one-electron model the most popu- 
lar approach currently in use in solid state physics. 

The concept of pseudopotential that has been developed within 
the one-electron model over the past 15 to 20 years appears at first 
sight artificial and inconsistent. It is stated that the interaction 
potential between an atom and an electron, with its notorious 
Coulomb singularity at the nucleus, may be replaced with a non- 
singular potential (or “pseudopotential”) without the electron so much 
as noticing the change. What is more, quite a number of such pseudo- 
potentials are possible for a given atom, each leading to essentially 
the same overall picture. This seems rather strange because the 
properties of crystals are uniquely determined by the element’s 
position in the periodic table, that is to say, by the charge of the 
nucleus that is “left out” in the pseudopotential method. 

Naturally enough, many of today’s materials scientists are skep- 
tical of the pseudopotential method and consider its predictions 
unreliable or at least difficult to interpret. Some physicists avoid 
the use of the method in their work and some use the technique with- 
out properly understanding it (which is not an error-proof attitude). 
It would be desirable therefore to have available an up-to-date 
solid state theory formulated in terms of “pseudism”. 

The pseudopotential method was first put forward in the frame- 
work of the nearly-free-electron model and it is in these terms that 
it is mostly discussed in textbooks. However, the method extends 
far beyond one particular model, indeed it provides insights into 
virtually every aspect of the behavior of electrons in a crystal. 
Today, a knowledge of pseudism is mandatory for anyone interested 
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in a thorough understanding of the properties of solids and their 
dependence on external conditions and composition. 

Although conceptually the pseudopotential method is closely 
related to and, indeed, originates from scattering theory (to show 
this is, more or less, the raison d’etre of this book), up to now there 
has been no other way to see this than by searching for scraps of 
relevant information through innumerable journal papers. This is 
another example of what often happens in science: a theory has been 
worked out and is already well in use before theorists manage to 
systematize the material into a consistent form. Surprising as it 
may seem, we have not even been able to find a definition for pseu- 
dism. In our view, the following one would be comprehensive: 
Pseudism is a system of ideas stemming from the fact that a short- 
range “compensatory” term may be added to the electron-ion interac- 
tion potential which will model the combined action of the nucleus 
and the core electrons while leaving unchanged the long-range part of 
the potential. The resulting potential is called a pseudopotential. 
This interpretation of pseudism is not only valid for model or augment- 
ed-plane-wave pseudopotentials but is also suitable for the pseudo- 
potentials in scattering theory and for those involved in band-theory 
secular equations. 

Even such a fundamental phenomenon as the periodicity of chemi- 
cal elements can be interpreted in terms of pseudism. As the number 
of atomic electrons increases, the outer uncompleted shell of the 
atom eventually fills to become what is called a core shell. It may 
then be dropped from consideration. This means that as the atomic 
number (and, accordingly, the number of valence electrons) in- 
creases, progressively higher shells will be neglected in this manner 
and the structure of the atom—in terms of its interaction with an 
external electron—will indeed show periodicity. 

Even though we have given the techniques of the one-electron 
theory a good deal of attention, our primary concern was to provide 
the reader with an understanding of the underlying physics. Since 
we did not intend the book to be a literature review, we usually start 
each new topic with references to the main works on the subject 
(both original papers and, if need be, reviews) and in the subsequent 
discussion we only mention the papers we considered the most in- 
teresting. We hope that in doing so we have been more or less objec- 
tive, though admittedly this was not always an easy task. 

We found it appropriate to illustrate some applications of the 
one-electron theory. One example is the relative stability of different 
crystal structures of metals and alloys, a significant part of the more 
general and, from the practical point of view, important problem of 
phase transitions. For those about to study solid state physics, the 
stability problem is certainly a promising start. All the relevant 
material presented in Chapter 1 in analytical form, will be found in 
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Chapter 2 in its “numerical” interpretation. Since crystal stability 
is still an area of research, the reader is thus invited to see—from 
inside, as it were—a solid state problem still in its evolution. 

In presenting the material, we have tried to show the logic the 
theory followed as it was developed and, to some extent at least, 
to show the “scaffolding” used to build it. Whether or not we have 
succeeded is for the reader to judge. 

The concept of pseudopotential, as we believe the reader will 
become convinced, is a direct consequence of quantum-mechanical 
scattering theory and we hope therefore that the book will be of 
interest not only for solid state specialists but also for a reader in- 
volved in other “quantum” activities. 

The intended reader need have no more background in quantum 
mechanics and atomic structure than is provided by, say, Shiff’s 
Quantum Mechanics. Scattering theory and some other necessary 
information will be found in the book itself. An elementary knowledge 
of solid state physics is, of course, desirable, although not necessary. 

It should be noted that, as is characteristic of science in general, 
the literature in this field abounds in jargon. The term “exchange”, 
for example, is often used instead of the more correct “exchange 
potential” which, in turn, is an abbreviation for “the potential of 
the exchange interaction”; “potential” often replaces “pseudopoten- 
tial” or is sometimes supposed to mean “the potential formfactor”. 
Whenever appropriate, we have tried to accustom the reader to the 
jargon, though not of course at the expense of the clarity and cohesion 
of the material. The same is true of equations: in journal papers 
indices are frequently omitted and arguments or variables are not 
meticulously specified. Although a self-evident simplification for a 
specialist, this is often a stumbling block for a beginner. 

The book should be read more than once, though some places 
can perhaps be skipped over. Some of the topics were chosen be- 
cause they are likely to be important in the future development of the 
one-electron model (these topics include scattering outside iso- 
energetic surfaces, the concept of effective medium, and higher- 
order perturbation corrections). Some problems are left, unobtrusive- 
ly, for the reader to solve. For example, will a self-consistent cal- 
culation yield wider or narrower energy bands than those resulting 
from the non-self-consistent treatment? To answer this question, the 
reader will have to master the material in Chapter 3. Our hope is 
that in the course of the book the reader will himself pose problems 
like this and we foresee the satisfaction he (or she) will take in solv- 
ing them. 

We hope that this book will be helpful for both theorists and ex- 
perimentalists. 

The Introduction and Part 1 were written by L.I. Yastrebov (who 
would like to use an opportunity to express his gratitude to his co- 
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author for enlightening discussions about pseudopotential secular 
equations (Sec. 5.2)). Part 2 was written by A.A. Katsnelson. 

Members of Solid State Physics Department of the Physics Faculty 
at Moscow State University were of great help in preparing the manu- 
script, as were the members of the Metals Science Department at the 
Moscow Aviation Technology Institute. We gratefully acknowledge 
their continuous support and valuable comments. 

Since the first (Russian) edition of this book was published in 
1981, evidence has built up to support the theoretical principles 
presented therein and to strengthen our confidence in their usefulness 
for solid state physics. In the first edition our discussion of the ap- 
plications of the theory was limited to the crystal stability of metals 
and alloys. Since a great deal of work has been done in other areas 
in recent years along the same principles, we believed it appropriate, 
when the English translation was being prepared, to illustrate the 
theory with more examples of its computational value. 


Introduction 


Chapter 1 


Principles 
of the one-electron theory 


This is the introductory chapter and it is written therefore in 
a very concise manner. For more details see, for example, [1-5]. 


1.1.1. The assumptions of the theory. In principle, we should be 
able to find out everything about a (stationary) crystal by solving 
Schrédinger’s equation for a system of interacting nuclei and elec- 
trons that form the crystal. This problem, however, is too hard to 
solve and some simplifications are therefore unavoidable. 

First, it is generally assumed that the nuclei are too massive to 
follow the rapidly changing spatial distribution of the electrons 
(the adiabatic approximation). For this reason we actually consider 
two Schrödinger equations, one for the electrons and the other for 
the nuclei. In the following we shall only be concerned with the 
behavior of electrons. 

It is further assumed that instead of treating the electronic sub- 
system as a whole, it is possible to consider separately the motions 
of individual electrons. Each of them is then thought of as moving 
in the effective field of the (stationary) nuclei and all the other elec- 
trons (hence the name—the effective field approximation). According- 
ly, the total electron wave function is expressed in terms of individ- 
ual electron wave functions, ¥;. 

Further approximations are made in the actual evaluation of the 
total function and in the consequent determination of the effective 
potential as seen by a single electron. It is customary to use Slater’s 
determinant which automatically incorporates the Pauli exclusion 
principle (6, 7]. This leads to a system of one-electron Schrodinger 
equations in the Hartree-Fock approximation; the crystal potential 
includes both Coulomb and exchange interactions between the 
electrons. The theory discussed in Secs. 2.2-5 and 3. 1 is valid, in 
fact, for any effective potential, that is, in order to develop a general 
form of the one-electron theory of crystals, the first two approxi- 
mations are quite sufficient. 


1.1.2. Bloch’s theorem. The potential acting on an electron in 
the crystal must possess the periodicity of the crystal: 


T (t + ty) = 7 (r), (1.1) 
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where t, is a translation vector (the radius-vector of the vth lattice 
site). It can be proved that, under the translation t,, a one-electron 
wave function ¥ obeys Bloch’s theorem: 


P (r + ty) = etkty Y (r). (1.2) 


Introducing ux (r) = WV (r) exp (—ikt,), it is easily seen that 
uy (r + t,) = ux (r), so that ¥ may be written in the form 


Wy, (r) = uy (r) et, (1.3) 


where ug (r) has the periodicity of the crystal lattice. The wave func- 
tion of an electron in a crystal is thus a plane wave modulated by a 
periodic factor. 

The vector k is known as a wave vector; for electrons in a vacuum, 
E = h*k?/2m, for electrons in a crystal the k-dependence of E is 
more complicated. Its actual form (the dispersion law) may be cal- 
culated by using the band theory of crystals. 

In the following, the crystal potential 7” is represented as the sum 
of the potentials V (r — t,), which are centered at the sites of the 
crystal lattice. This allows us to write a Schrodinger equation for 
each such (single-site) potential. * For the information about the 
crystal to be retained the wave function of this single-site Schrodinger 
equation must obey Bloch’s theorem (1.2). 

Bloch’s theorem (1.2) may therefore be considered as boundary 
conditions for a single-site Schrodinger equation. 


1.1.3. Reciprocal space. The wave vector k has a dimension of 
reciprocal length and is therefore defined not in the coordinate space 
of the crystal but rather in reciprocal space. 

In reciprocal space we can set up a reciprocal lattice: its sites will 
be specified by radius-vectors g,. The vectors t, and g, are related 
by the important relationship: 


tun =2n fyn ’ 


where fyn's are integers. It is known that in real space each site 
can be surrounded by a polyhedron with the property that all points 
within it are nearer to the given site than to any other one. Such 
a construction is called the Wigner-Seitz cell. Its analog in reciprocal 
space is also known as the first Brillouin zone. Calculations of dis- 
persion laws are carried out in band theory only for the wave vectors 
inside this zone. 

If a face of a Brillouin zone is the perpendicular bisector of a vector 
g, (of which there are an infinite number), we may write an obvious 


* Throughout the book, the script letters 7° and Y° denote the potentials 
and pseudopotentials, respectively, that result from the superposition of the 
single-site potentials, V, and pseudopotentials, W. 
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relationship for any vector k ending at the face: 
[k | = |k —g, |. (1.4) 


A crystal under study is usually assumed to be a large one, with 
the wave functions having the same values at opposite faces. This 
means that the vector k can only take on fixed values, but these are 
so closely spaced that, if needed, a summation over the allowed k 
values may be replaced by a d®k-integration. 


1.1.4. Measurement units. Planck’s constant is taken to be unity. 
The Bohr radius is adopted as the length unit, viz. ag = 0.529177 A, 
the unit of mass is twice the mass of an electron m, and the energy 
unit is the rydberg, the ionization potential for the hydrogen atom: 
1 Ry = 13.6 eV. We thus have in atomic units: 

h = 1, h/(me*?)=a,=1, me? = 1, 
m = 1/2, è =2, 1 Ry = e?/(2a,) = 1. 

The unit of length in reciprocal space is 2x/a (a being the lattice 
constant of the crystal); k = 2n/a (z, y, 2), with z S y S z; and 
fn = 2n/a (A, B, C), where A, B, C are integers. In this system of 
units, Schrédinger’s equation for an electron takes the form: 


(H — E) Yy (vr) = (—V2+ 7" (r)— E) Vy (r) =0. (4.5) 


1.1.5. General approach to a band problem. For all existing methods 
of solving Schrédinger’s equation (1.5), the underlying principle 
is virtually the same. First, a set of trial (initial) functions @; is 
chosen. The unknown function VY, is then presented as an expansion 


in Ọ; in terms of unknown coefficients B}: 
Wu (r) =>) BED, (r). (1.6) 


Substituting (1.6) in (1.5), multiplying on the left by f and in- 
tegrating over all space, we arrive at a system of equations for BY: 
2) (Hj; —ES,;) BF =0, (1.7) 
where 
H y= | OF (x) AO, (x) d'r, 
(1.8) 
Su = | OF (x) D; (r) dr. 
The system of linear homogeneous equations (1.7) has nonvanishing 
solutions if (and only if) its determinant vanishes: 
det | Hy; — ES ;; |=0. (1.9) 


14 Introduction 


Equation (1.9) is called the secular equation, and the determinant 
in (1.9) the secular determinant. The Æ values that satisfy (1.9) 
are the quantities E (k) that we require. Having obtained them, we 
solve (1.7) for the BY and, atthe next step, get Yy (r) by using (1.6). 

A secular equation may be very complicated in practice, but the 
expansion of Y, in terms of trial functions is a common feature for 
all band theory methods. Sometimes it is convenient to take as trial 
functions the solutions of a certain auxiliary problem: 


AD; =e,0,, (1.10) 


where B, is the “initial” Hamiltonian and e;—the “initial” dispersion 
law. Introducing the difference 
A=H—H,, (1.14) 
we obtain the equation 
det | (e; — E) ôji + A; |=0, (1.12) 


in which the orthonormality of the functions P; has been taken 
into account. 
When Eq. (1.6) has only two significant terms, Eq. (1.12) can be 
solved analytically, i.e. 
E =E E, (1.13) 
where 


E=E+ E, E= (1/2) (£a +£), E4 = (1/2) (Aaa + Abb), 


n 1.14 
E, = (412) VeL Bea e, Brn) F 4| Aas Ë (1.14) 


To see the meaning of this result, consider a system with two 
quantum levels, €a and e,. Switching on the interaction A between 


Eb 





Fig. 1.1. Old levels ¢, and €, are hybridized into new levels E+ and E_. The 
E, level is introduced to emphasize the inadequacy of the old quantum numbers. 


them gives rise to an “intermediate” twofold degenerate level Ey. 
The latter then shifts in energy by the amount E, (upwards or down- 
wards) and splits into a pair of new levels, E and E_, separated from 


the level E = E, + E, by +£,, as shown in Fig. 1.1. 
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It can be seen from Eqs. (1.13) and (1.14) that it is impossible to 
label the new levels with the old quantum numbers (a and b), since 
hybridization has taken place and new “numbers” have emerged 
(denoted by + and —). 

If Aap (=Aj,) is set to zero, there is no level hybridization and we 
are simply left with two displaced levels, eg + Ag, and gp + App. 


1.1.6. Linear combination of atomic orbitals (LCAO). Let us apply 
the above formalism to the formation of a crystal by bringing to- 
gether N atoms, each of which has M levels described by the quantum 
numbers « (rather, these are sets of quantum numbers). The quantum 
number i in the expansion (1.6), in fact, includes not only the number 
a but also the index of each atom v, which makes the dimensionality 
of the secular equation (1.9) enormously large. To avoid this the 
Bloch ‘boundary conditions (1.2) are incorporated into the trial 
functions themselves which are written for this purpose as (Bloch’s) 
linear combination of atomic orbitals @, (r), viz. 


Dy (k, =e È eM vg, (r— ty). (1.45) 


It then turns out that the matrix elements in the secular equation 
(4.7) are labeled by atomic quantum numbers while the summation 
over the crystal sites has already been performed within the equation 
which greatly reduces its size: 


= Del e wf på (1) Ho, (r—t,) dèr = Hga (k). (4.16) 


In the tight-binding limit when the electron interacts, in effect, 
with only one atom (the central one) the hybridization of the initial 
dispersion laws (atomic levels) occurs even if the functions of neigh- 
boring atoms are orthogonal. The point is that the initial atomic levels 
are affected by the matrix elements of the central atom’s potential, 
taken between the functions of the central atom and its neighbors. 
A change in the boundary conditions imposed on the wave function 
acts as a “perturbation potential” and the requirement that it vanish 
at infinity is replaced by condition (1.2), which is easily seen to be 
satisfied by LCAO, Eq. (1.15). 

When the functions of the central and neighboring atoms do not 
overlap (if, for example, the lattice constant goes to infinity), Eq. 
(1.12) automatically reduces to the equation for the energy levels 
of a free atom, viz. 


det | (eg — E) 8g, | = 0. (1.17) 
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1.1.7. Nearly-free-electron model (NFE). In this model a crystal is 
viewed as a spatial lattice of ions embedded in an electron gas. 
While in LCAO the perturbation in the eigenvalue problem is the 
departure of the true potential from the atomic potential, in an NFE 
model it is the departure from zero. For a zero potential (in, say, 
a vacuum) an electron wave function is a plane wave normalized 
to all space. In a crystal, the normalization integral may be con- 
veniently presented as the sum of contributions from all the lattice 
sites. Since these contributions are all the same, they can be taken 
out of the sum and what remains is simply the total number of sites, 
N. If we introduce the volume of the Wigner-Seitz cell, Q = 2/N, 
where Q is the volume of the crystal, we find that the plane waves 
may just as well be normalized to the Wigner-Seitz cell. 

The function u, (r), which has the periodicity of the crystal, may 
be expanded in Fourier series over the reciprocal lattice vectors 
g, [8]. As a result, the function Wy (r), Eq. (1.3), then takes the 
form of an expansion in terms of the plane waves |k + g,): 





1 ilk+g,)r 
Wy (r) = v5 Sd) By kt en), (1.18) 
“on 


The secular equation (1.12) in the NFE model may be written as 


det | (¢n — E) Ôn + (k + gw |7| k+g,)1=0, (1.19) 
where e, = (k+ gn)? and 


(k |F |k) HA È e7 ikt 7 e (r) eitt dêr (1.20) 


is the matrix element of the crystal potential. 

The crystal potential can always be written as the sum of the 
contributions from all the lattice sites. Since these contributions 
(or single-site potentials V (r)) are all equal, the corresponding 
integrals are independent of the site index and we have a general 
expression: 


KFE o= y D | V e t) etk gsr 


=S (k—k’) (k’ | V |k) = 8S (q) V (a). (1.21) 


Here we have denoted q = k’ — k and introduced the structure 
factor S (q) and the formfactor of the single-site potential V (q) = 
{kk +q|V|k). Note that, unlike plane wave normalization, the 
integration in the formfactor is over all space rather than over a 


Ch. 1. Principles of one-electron theory 17 


Wigner-Seitz cell alone: 


1s) ani 
S(q)=— De (1.22) 
(k+q|V | k) =- f e-tk+arV (r) ettr d3r. (1.23) 
° over all 
space 


The structure factor S (q) is defined for any vector q. For a periodic 
lattice, S (q) is zero unless q happens to coincide with any of the 
vectors of the reciprocal lattice, i.e. S (q = gn) = 1. 

The potential formfactor (1. 23) is also defined for all q’s. If the 
potential operator V is merely a multiplication operator, the form- 
factor depends solely on q and is called local. In the following we 
shall introduce more complicated effective potentials (pseudopoten- 
tials), with formfactors dependent both on q and on k. These form- 
factors and potentials are called nonlocal. 

Let us now consider a few simple examples. Let V (r) be a screened 
Coulomb attraction potential: 

Ve" -- _ Ze 
; 





enCr. (1.24) 


To calculate its formfactor, we direct the polar axis along the vector q 
[8]. We find 
AnZe 1 

w g+c? ° 


In the limit when C vanishes, (1.24) is replaced by the familiar 
(nonscreened) Coulomb potential Vc, and (1.25) is replaced by its 
formfactor: 


Vi" (q) = (1.25) 


Ze? 4nZe? 4 
Ve(r)=———, Vel) =- F 


Comparing Vo and Ve" shows that the screening of the Coulomb 
potential removes the singularity of the formfactor at small q while 
leaving its behavior at large qg (V (q) œ q~*) unchanged. This is 
typical of most formfactors, as will be shown in Secs. 4.1-3. 

Let the potential V (r) be vanishingly small. In this case the crystal 
is what may be called an empty lattice: electrons move within an 
imaginary crystalline lattice with no potentials on its sites. 

‘The secular equation (1.19) for the empty lattice model is very 
similar to Eq. (1.17), which was obtained for the LCAO model in the 
limit of large interatomic separations: 


det | (€n — E) San |= 0. (1.27) 
The empty lattice model is equivalent to a free electron gas subject 


to periodic boundary conditions. The band structure for the model 
is discussed in [9-11]. 


2—01063 








(1.26) 
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1.1.8. Perturbation theory. The secular equation (1.19) may be 
approximated by a perturbation theory expansion. To see this, we 
employ a version of Léwdin’s method of “folding” secular equations 
[12-14]. 

Let us write a matrix X in block notation: 


A B 
X= P p) . (1.28) 


Next we introduce the unit matrix J and matrices Y and Z: 


y ( I n) D °) 
=\_ pict}? 4>\o p+}: 


It can be seen that det Y = det Z = 1 (this is proved by decomposing 
the determinants in terms of their minors, starting from the upper 
left-hand corner). Now consider the matrix product P = XYZ, 
Clearly, det P = det X. We obtain 


det X = det D-det (A — BD =C). (1.29) 


We have thus succeeded in expressing the determinant of a large- 
dimension matrix X in terms of the determinants of small-dimension 
matrices. 

We now can use Léwdin’s method to rewrite Eq. (1.19) as a single 
line. Let the diagonal perturbation matrix element <k |V |k) be 
first order relative to k?, (k -+ gn | V |k) second order in /*, and 
(kk + gnr |V Ik +g.) third order in 24°, though for the moment 
we shall ignore third-order terms. Block A in (1.28) only contains 
one element, k? — E + (k| V |k); blocks B and C are Hermitian- 
conjugate with matrix elements (k +g, | V |k); and block D 
is diagonal. When det D is zero, Eq. (1.19) is satisfied on account 
of (1.29), but its solutions are of no physical interest since they 
only describe high-lying empty lattice dispersion laws. As a result, 
we get from Eqs. (1.19) and (1.29): 


E(k)=k?+(k|V|k)— >} ee ee eee (...) (4.30) 
n=Æ0 


where (. . .) indicates higher-order terms arising from the nondiago- 
nality of D. 

We have thus obtained the Brillouin-Wigner form of the perturba- 
tion theory expansion [15]. Now we shall take £ in the denominator 
of (1.30) as the dispersion law given by the first-order perturbation 
theory (E = k? + (k | V | k>). Taking into account that, by (1.20), 
the local potential formfactor (k + g, | V |k + g,) is independent 
of n, we arrive at the perturbation theory expansion in the Rayleigh- 
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Schrödinger form [45]: 


E(k) =k Vik J Ht ELSE., (31y 
næÆ0 


Even though less exact, Eq. (1.34) is extremely valuable in that 
it allows analytical expression for dispersion laws. In fact, it is 
the basis for the present-day theory of the atomic properties of solids. 
We shall see later that this expansion is used for calculating the total 
energy of a crystal. 

There are two types of sums in Eq. (1.31): infinite sums over the 
vectors of the reciprocal lattice, that form mth order contributions 
and the infinite sum of these contributions, usually restricted to the 
second- or third-order perturbation terms. One should be confident 
that the sums of both types are convergent. 

Consider a sum over g,,. In view of Eq. (1.25), each term of the 
sum is proportional to g;° at large g,. The density of points in re- 
ciprocal space increases as gh, that is, the g,-sum does converge, 
each term giving a finite contribution. 

The convergence of (1.81) as a whole requires much more careful 
analysis and in Sec.2.5 we take up this subject in more detail. For 
the moment we restrict ourselves to a generally accepted statement: 
a perturbation expansion diverges if the perturbing potential pro- 
duces at least one bound state. 

This is a point of extreme importance. It is stated in fact that 
whether or not the continuum spectrum (E > 0) may be treated by 
perturbation theory depends on the presence or absence of bound 
states (which are only possible at Æ < 0). The existence of bound 
states is known to depend on the depth of the potential. If the poten-. 
tial is too strong (what exactly this means is determined by some- 
criteria), the mth order contribution equals or exceeds that of the: 
(m — 1)th order and the perturbation expansion fails to converge. 

Every atomic potential has bound states. Hence, to make per-- 
turbation theory applicable, a new and rather peculiar potential 
has to be constructed. It has to be weaker than the original one, but. 
it must still affect the continuum electrons in exactly the same way. 
The contribution of these potentials is discussed inf Secs. 2.3-4, 
4.1-4, 5.2, 


1.1.9. The Fermi free-electron gas. In this model the crystal for-- 
mation process is as follows: N neutral isolated atoms are each stripped 
of Z outer (valence) electrons. The resulting ions are then brought. 
together to form a crystal lattice, with a volume of Q% per ion. An 
electron gas is then allowed into the lattice, its spatialf distribution 
being assumed uniform, so that its density is Z/Q,. 


ae 
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If we ignore the fact that the lattice potential is not constant, the 
electron dispersion law is simply Æ = k?. The energy levels occupied 
by the electrons range from zero to a maximum, which is known as 
the Fermi level of a free electron gas: 


= (312Z/Q,)2/3 = ke. (1.32) 


It was shown in [8] that a dispersion Jaw is not altered if electron- 
electron Coulomb interactions are allowed. Including exchange 
interactions, we obtain [8]: 


—h Er (4 1— y? 1--y 
E(k)=k—etE (1 = In| 4], (1.33) 


where y = k/ky. The oxchange interaction strongly affects the states 
within the Fermi sphere. At k = kp the gradient of E (kp) goes to 
infinity, which is physical nonsense, although this singularity dis- 
appears in more elaborate theories. In Secs. 3.1,3,4 the exchange 
interaction is discussed in more detail. For the moment, note that 
the exchange correction term in Eq. (1.33) (corresponding to the 
(k| V |k) term of (1.31) is at a maximum at k = O and decreases 
with increasing k. 





1.1.10. Total crystal energy in the NFE model. To calculate the 
total energy of a crystal, the dispersion laws are summed over all 
the occupied states. This is performed by integrating (1.31) over 
all the wave vectors within the Fermi sphere. As a result, the total 
crystal energy, Xer, takes the form 


Zer = U9 + Uns + Uiats (1.34) 


where Ulat takes account of the Coulomb repulsion between the 
ion cores; U, and Ups are respectively the volume- and structure- 
‘dependent electronic contributions to the total energy: 


U=2 -m | (K| Y |k) dk, (1.35) 

inside the 

Fermi 

sphere 

k+-q{V 1k) [2 
Uns = 2 oar a | S(q) | { eee dip pint 

"saad the 
Fermi 
sphere 


(1.36) 


where the g, are replaced, for generality, by the ordinary position 
vectors q. 

The U, term describes a “free” electron gas with the inclusion of 
-electron-electron interaction; Uy, is associated with band characteris- 
tics and is therefore called the band-structure energy (or simply band 
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energy). The term Ue)" takes care of the fact that when calculating 
E (k) the interaction energy of each electron pair contributes to the 
energy of each electron, that is, in summing the one-electron energies 
in (1.34), the electron-electron interaction energy is counted twice. 

The calculation of U, and Uat may be found in many textbooks 
(see [16-18] and [19]). lt is instructive to discuss Ups- 

If a potential V is local, its formfactor is k-independent and can be- 
taken out of the k-integral. The k-integral that appears then in 
Eq. (1.36) is known as Linchard’s function [3]: 





Q 1 Bh 
i) = —2 GFW kgk: (1:37) 
inside the 
Fermi 
sphere 


By assuming a spherical Fermi surface, one gets 


-1 — r? = 
n= — 2 (ġa (im EN), 1a 
where x = k/(2k;). There is a similarity between Eq. (1.38) and the 
exchange correction term to the dispersion law (1.33), the only differ- 
ence being that in y (q) the singularity occurs at k = 2k, instead 
of k = kp as in Eq. (1.38). 

The electron-electron interaction energy Utt may be represented 
[17] as a result of interactions between the nonuniform part of the: 
crystal’s electron density and the potential which is the difference 
between the crystal potential 7°" and the initial potential #71 
of the system of ions. By formally setting up a function by which 
to characterize this potential, a (q) = Fi (q)/7 © (q), we can 
express Ue} in terms of the formfactor of 7 (q), Eq. (4.21): 











Ua = 2 | S(q) 121 V (q) |?%(q) (14 —@ (q)), (1.39): 
Uns =à | S (q) 12 |V (a) I? x(a) —Uel' = 2 |S (q)|2 | V (a)l? x (a)a (a). 
(1.40) 


Note that, in deriving this equation, we made no assumptions 
about the nature of the potential (except for its locality) or about 
the way in which the crystal potential is composed of the ionic 
potentials (that is, about the screening mechanism). y (q) is a function 
arising from perturbation theory whereas œ (q) is the ratio of the 
original potential of the ions to the crystal potential. In Sec. 3.2 
we shall encounter dielectric screening, in which case Lindhard’s. 
function y% (q) will also be needed and œ will be interpreted as the 
system’s “response” to a perturbation. This, however, will only be 
a matter of elaborating the technique: the meaning of Eqs. (1.34)- 
(4.40) will remain intact. 
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A frequently used characteristic function Pps (q) is defined as 
Drs (a) = | V (q) P x (a) & (q). (1.41) 


‘Using Eq. (1.41), Ups can be rewritten as a sum, over the lattice 
sites, of Fourier transforms Pps (t) which are defined as 





Q . 
Dos (t)=2 Gaz | Oy, (a) e'a dg. (1.42) 


Pys (t) may be thought of as the potential of the interaction 
between ions through the electron gas (the same electron is simultane- 
‘ously attracted by all the ions, hence their mutual attraction). It 
is this interaction that must compensate for the direct Coulomb 
repulsion of the ions. The total potential for interatomic interaction 
is 


(Z 
D(t) = +D (t). (1.43) 
Note that @ (t) describes the interaction between atoms given 


that the system’s volume is constant. In other words, ® (t) is the 
gedistribution of the atoms. 


Part 1 


Theoretical principles 
of the pseudopotential method 


Chapter 2 


Scattering theory 
for “solid-state people” 


2.1. Mathematical formalism 


The subject of this chapter is quantum mechanical scattering 
theory. Although the concepts of the theory are widely used within 
the one-electron approach, the relevant mathematics can nowhere 
be found in a compact form and often many books, sometimes very 
obscure ones, have to be consulted before a needed formula is located. 
This makes things difficult for someone interested in band-theory 
computational methods and, especially, in how the methods are 
related to each other. We have therefore attempted to compile all 
the necessary information, avoiding whenever possible asking the 
reader to take anything “on trust”. 

This chapter may thus be considered as an introduction to the 
modern formalism of the one-electron approximation. 


2.1.1. 8-function. This frequently used function has the following 
property: for a continuous function f (x) 


| F) 8 (Kx) de = f(x). (2.1) 
the integral extending over all space. 


2.1.2. Green’s function method (GF). This is a powerful tool for 
treating differential equations which can be described briefly as 
follows [20]. Consider the differential equation 


(=V? — E) yẹ (r) = f (r). (2.2) 
We chose Schrédinger’s equation for the sake of concreteness but the 
GF method in fact applies to any equation. 


Green’s function G (r, r’) is defined as the solution of (2.2) whose 
right-hand side is replaced by a source function 


(—V? — E) G (r, r) = —6 (r — r’). (2.3) 
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In terms of Green’s function, the solution of (2.2) is given by 


paspa | Gir) sar’, (2.4) 
over all 
space 


where A is an arbitrary constant and @ satisfies equation (2.2) for 
a zero right-hand side, 


(=V? — E) ọ (r) = 0. (2.5) 


To prove equation (2.4), we apply the operator (—V? — £). Since 
Vy? acts on the coordinate r, it may be introduced into the integrand. 
The first term in (2.4) is, by (2.5), zero. We use Eqs. (2.3) and (2.1) 
for the integrand, and this completes the proof. 

The integral equation (2.4) is more convenient to deal with than 
a differential equation since the boundary conditions needed for 
calculating wp are already included in ọ (r) and G (r, 1’). 


2.1.3. Spectral expansion of Green’s function. Let ¢, be the eigen- 
values of Eq. (2.5), and @, the corresponding eigenfunctions. Since, 
by definition, GF obeys the same boundary conditions that @, (r) 
does, it can be expanded in terms of these functions (which form 
a complete set for GF): 


G(r, r) =>) anPn (r). (2.6) 
Insertion of (2.6) in (2.3) leads to 
È an (En — E) Pn (r) = — ô (r— r’). (2.7) 


Multiplying this equation by gy (r) and integrating over r we 
find after using the orthogonality of ọọ, that 


an (2.8) 
em r= — 3 sae (2.9) 


In deriving (2.9) we have used the fact that the left-hand side of 
(2.3) is a self-conjugate operator. 

Expression (2.9) allows us to determine GF. It follows from (2.9) 
that GF is Hermitian: 


G (r, r’) = G* (r’, r). 


Combining Eqs. (2.7) and (2.8), we arrive at an important result 
characteristic of all complete sets of functions, i.e. 


2 Ph (1) OF (r') = 6 (r— r’). (2.10) 
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2.1.4. Free-electron Green’s function. In the special case of free 
electrons the eigenfunctions of equation (2.5) are plane waves: 


-$ eikr 

V Gap | 
| OR eee (2.11) 
Go (r, r')= — -aF \ awe PR, 


Pk = 


where x = + V E. This integral is known to be [20, 21] 
4 eixir -r| 


Go (T, r)=— a rT: (2.12) 
It can be easily seen that at r = r’ GF has a singularity. Conse- 


quently, the derivative dG/dr is discontinuous at r =r’: 


(3 25) +0. 


ðr ðr 





Tr’ 


2.1.5. Spherical harmonics. Coming back to Eq. (2.2), let us set 
f (rt) = —V (rt) » (r) where the potential V (r) is spherically symmet- 
rical. It is then convenient, in Eqs. (2.2) and (2.5), to change to polar 
coordinates: 


x=rsin8-cosg, y=rsin®8-sing, z= r cosð; 
0<0<n, OXQ<2n. 


The operator V? is the sum of two operators: 
1 2 
V? = Vit => Vote (2.13) 


where the operator V; only acts on the absolute value of the vector r- 

and V, only acts on the angles between r and the coordinate axes. 
We shall need the explicit form of V?: 

1 2 ô . 

Vir Gp reo, (2.14): 

The eigenfunctions of this operator, Y ım (9, @), are called spherical 


harmonics: 
— V6, oYim(9, P) =L (L+ 1) Yim (8, 9), (2.15) 
BY im (8s P) =N mP” (cos 8) ee, (2.16): 


where V,,, is a normalization factor and PY"! is the associated 
Legendre polynomial of degree | m |, which is dependent solely om 
the angle 0. It can be seen from (2.15) that the solutions Y,,, are- 
degenerate with respect to the index m. 


126 Part 1. Pseudopotential methods, Theoretical principles 


The spherical harmonics obey the addition theorem: 


' 21 +1 
S) Vin (1) Yim (1) = EE P, (cos Oy, r), (2.17) 





m 


where Ym (r) is a harmonic dependent on the angles between the 
vector r and the coordinate axes, and P, is a Legendre polynomial 
‘dependent on the angle between r and r’: 


P=" = P, — P,(c08 8) leos ant = 1. 
The spherical harmonics are orthonormal: 


| Yin (1) Yrm (8) dQr Sir8mm" (2.18) 


‘where the integration is over the angles between r and the coordinate 
‘axes, and 6p,’ is the Kronecker delta: 


1 if p=p’, 
orp = | 0 if , 
Dp . 
The harmonics are thus “normalized to unity”; the normalization 
factor Njm includes a factor of 1/V 4m, so that, as a special case, 
‘we have 
1 


TE (2.19) 


Y 1=0,m=0 = 
There exists the addition theorem for harmonics with different 
J, m and l’, m’: 


Yr (r) Yu (r) = > ChrY ty (r), (2.20) 


where the quantities Cir are known as Gaunt's coefficients. They 
are, in fact, the matrix elements of Y, between Yj, and Yç, i.e. 
LILI) 


Chr = | Yi (0) Y; (1) Y r (1) dO. (2.21) 


Here and often in the following we use L as a shorthand for {l, m}. 
The spherical harmonics Y, are complex. We may also define 
ireal harmonics [22]: 
Y Yı,- - Yim—Y1,- 
Yim =, ig = (2.22) 
‘These are also called “cubic harmonics” since, under symmetry 
<operations, they transform as cubic symmetry eigenfunctions. 
The l = Q, 1, 2, 3, 4 harmonics are usually referred to as the 
-s, p, d, f and g functions, respectively. In many textbooks angular 
-dependences for s, p and d functions are plotted in Cartesian coordi- 
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nates. These are, of course, the cubic harmonics because the spherical 
ones have imaginary components and cannot therefore be plotted. 

Formally, the cubic harmonics can be labeled* by certain p indices 
instead of m indices; the cubic harmonics will then be ortho- 
normal with respect to both m and p. 

The functions used in band-structure applications (see for example, 
Sec. 5.1 for a discussion of GF structure constants) are usually the 
cubic harmonics, since matrix elements between them are convenient 
in being real. This is, in fact, achieved by simply changing notation, 
and so in what follows we only use spherical harmonics because they 
make the Hermitian property more transparent. 

Spherical (cubic) harmonics, since they are the eigenfunctions of 
the operator Vé,9, form a complete set of functions on sphere and 
therefore any angle-dependent function can be expanded in terms of 
Y, (r): 


f(r) => fa (lr I) Y; (r). (2.23) 


2.1.6. Radial equation. Let us consider Eq. (2.2) with —V (r) 4p 
on the right. By (2.23), its solution p can be expanded in terms of Yz: 


p (r) =>} Rim (1 r|) Y; (r). (2.24) 


Substituting (2.24) in (2.2), changing over to spherical coordinates, 
and using Eqs. (2.14) and (2.15) we obtain a differential equation 
for Rim: 


ZZ Aml) +[V (r) +2] Bin) = 0. (2.25) 


Obviously, .#%;,, is independent of m and this index may therefore 
be dropped. Since Eq. (2.25) is linear, the function 2, will be defined 
in terms of a multiplicative constant independent of r which will be 
determined by the boundary conditions (e.g., by normalization). 
Equation (2.24) will therefore be replaced by the more correct expres- 
sion 


p (r) => BLA (r) Y, (r) (2.26) 
with 2, satisfying Eq. (2.23). 


2.1.7. Bessel, Neumann and Hankel spherical functions. Equation 
{2.20) always has two linearly independent solutions, one regular 


* It is clear from (2.22) that cubic harmonics cannot be labeled by the old 
indices m. This “prohibition” results from the mixing of quantum numbers which 
is caused by the same kind of perturbation we saw in LCAO and NFE models. 
ìn this particular case the “perturbation” is due to the change in symmetry. 
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nd the other irregular at the origin. For a zero potential, the 
well known and are called, respectively, the Bessel and the Neumans 
spherical functions, jı (x) and n; (z), where x = xr (see [20, 24}) 
For the first few / their explicit forms are: . 





, sin z cosST 
jal) =—— ’ No (x)= ——— , 
sin z CoS T coSs T SIn 
ji (1) = z2 zx? Ny (t) = — a Zz? (2.27) 
. 3 3 
jo(a) = (3 —+) sine—3 coss, nm (2)=— (=e — ) cos 


In the small-x limit the asymptotic values are: 


zl O AAN | 
hl) = wpa? O) Sap an (2.28) 
where the / in (2/ + 1)!! is a positive integer or zero. These formulas 
provide a good approximation to j; for z? < 41 +- 6 and to n, for 
z < 2 [21]. 
At large z the asymptotic values are [23]: 


_, sin (z —In/2) 


ji (x) = 7 > h (x) —— __ 08 (x —11/2) , (2.29) 


X> x 

These formulas are good for z > 1 (l + 1)/2. The oscillation am- 
plitude is already within 10% of the asymptotic value for z > 24 
[23]. 

Both functions obey the recurrence relations for j; 





jus (0) = — a [ay (2), 
jala) tii (=E ja) a>) 


and similarly for n, 
The Wronskian of the functions j; and n; is nonzero: 


ha) $14 (2) — m (2) E hlar. (2.30) 


The functions j; and n, are related to the ordinary (cylindrical) 
Bessel and Neumann functions, J, (x) and Jı (x), by 


h= Y Earle, m= E Iae 
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It is often convenient to work with a complex combination of j, 
and nı which is known as the Hankel function: 


yë = f(x) + in (2). (2.31) 
For large x its asymptotic value is 
vi _ > A gt i(x—Ia/2) 


X00 T 


The functions yj and y7 correspond to the incoming and outgoing 


waves, respectively. For a more detailed discussion of j; and n; 
see [21, 24]. 


2.1.8. Useful formulas. A good example to illustrate the use of 
Eqs. (2.23) and (2.26) is the plane-wave expansion in Y, (r) 
1 1. 4 els * 
yg = TF > iji (kr) Yi (k) Y, (1) = Dy hz (k, r) Y, (1), 
="0 " 0 
L L (2.32) 


where Q, is the normalization volume and 








hy (k, r)= E ilj, (kr) Y$ (k). (2.33) 
A comparison of Eqs. (2.32) and (2.26) shows that, for a plane 
wave, the coefficient Bz, in (2.26) is 4mi'Y{ (k) and it can by no 
means be omitted. 
By using Eq. (2.32) we obtain a re-expansion of the spherical 
functions relative to another origin of coordinates: 


ji(x|r—t I) ¥n(r—t)= 2 Pryryjv (4r) Yi (r), 


. * 2.34 
Pir => Chil (xt) Yi, (t). ( ) 
1 
The following formula [25, 26] is more difficult to derive 
n(x |r—t])Y,(r—t) => Kivu (xr) Yr (1), 
(2.35) 


Kir = > CE nu (xt) Y$ (t). 
1 


Equation (2.35) is valid for |r |< | t }. 
.< If cubic harmonics are used, Eqs. (2.34) and (2.35) yield an anal- 
ogous result for the yf functions. 
Another useful formula for GF (2.12) is 
4 inir—r’| , , . M , 
Se oT = Di fi(er<) vi (xr YE (r) Yz (r), (2.36) 
L 
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where r< (r>) is the smaller (larger) of the lengthsr and r’. (It is 
due to these restrictions on r and r’ that a GF remains finite at r 
when r’ tends to zero.) 

A Green function can be divided into two parts, one regular at 


r =r’ (GF) and the other singular (GĦ): 


Gye = — nee). Im Gy, 

as 
gine. 1 cose n=") _ Reg (2.37) 
7 4n [r—r’ | = oe 


The function Gy® satisfies Eq. (2.5) and G)"® satisfies Eq. (2.3). 
If the full Gy is replaced by Gò, Eq. (2.4) is still a solution to 
(2.2). In this case, known as the standing wave representation (note 


that this is not an approximation), the solution of Eq. (2.4) is real. 
With the help of (2.32) we find that 


py (21+4) ji(x)=1. (2.38) 


It is also noteworthy that, as follows from (2.10), the spherical 
harmonics form a complete set: 


> YEMYi(")=6 (r). (2.39) 


1 
L=0 


2.1.9. Projection operators. The concepts of orthogonality and 
completeness are closely related to projection operators which are 
used in pseudopotential theory. For an orthonormal set of functions 
{a), we can write an operator that “projects” a function f (r) onto 
the space of the | a) functions: 


P=DP., Pa=laria| (2.40) 
in the sense that 


Pip=d Palp=% | a) (a | f). 


Here the right-hand side is clearly recognized as an expansion of 
f in terms of the | a) functions, that is the projection operator is 
simply an expansion operator over a basis set. 

A simple example is the operator that singles out the Lth compo- 
nent of f(r), i.e. 


P =] L4}, 
P, |D= DL, D=YL M) | YEW) AQ (2.41) 
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By applying (2.41) to the plane wave (2.32) we get 


P, |k)= h; (k, r) Y, (0). 


It can easily be proved that any projection operator is idempotent, 
i.e. PP =P. 


2.2. Scattering on an isolated potential 


2.2.1. Phase shifts, analysis in terms of Green’s functions. We can 
now apply* the GF method to scattering on an isolated spherically 
symmetrical potential V (r). We employ the standing wave represen-- 
tation (2.37): 


Ging — 2a © jil(xre)nlxrs)Y LŒ) Y, (r). (2.42) 


Substituting (2.42) into (2.4) and taking into account that, for 
free electrons, k? = x? = E, we obtain using (2.32) 


pH È hi (% Vi (r) 
L 
+42 È ji (rc) my (ers) V(r) P(e) Yr (r) Br’ Vy (r) 


We now change to spherical coordinates, use (2.26) and integrate: 
over the angles between the vectors r and r’. We choose B; to be- 
of the form 





An 
B,=b — iY, (x). 
L L V2 i( ) 
This allows us to eliminate the dependence on the direction of x. 
in final formulas. As a result, 


R= [E+ j (#74) V (ry) Airy) r? dra} ji (xr) 


r 


{u | Jaera) V ry) Ai (r) r? dry} m (ur) 
0 


It is an easy matter to obtain the same expression in the running“ 
wave representation (which only means replacing n; by iyt). 





* The same approach is used in Sec. 5.1 to treat energy spectra in crystals. 
This leads to the secular equation of the Korringa-Kohn-Rostoker (KKR) 
method. 
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Let us define the functions: 
Sı (r)= — x \ ji(ur,) V (r4) Ai (E, ri) ri dry, (2.43) 
0 


A, ¢ . 
Ci(r)=F-+% \ n (xr) V (ry). 21 (E, ry) ri ary. (2.44) 


Note that S, (0) = 0, C,(co) = A/b,. Using Eqs. (2.43) and 
(2.44), we find 


Ayr) = Cı (r) ji (xr) — S: (r) n (xr). (2.45) 
The presence of a potential has thus resulted in mixing the regular 
and irregular solutions of the homogeneous equation. 
In the limiting case of small r 
C,(r — 0) = const, (2.46) 
Aılr —> 0) = const? -jı (xr). (2.47) 
For a potential that decreases rapidly enough (say, dropping to 


zero at r > R), the functions C; and S, at large r are independent 
-of r. Let us introduce 


const? = V C}-+ S? 


-and define an angle n; (E): 


nı (E) = arc tan [S, (r — 00)/C,(r > o)]. (2.48) 

Eq. (2.48) is equivalent to 
S,(r —> oo)=const;-sin Ny, (2.49) 
C,(r —- œ) = const? -cos nH, (2.50) 


Ry (E, r — oœ) = constr (cos jf, (xr)— sin ngen (xr)). (2.54) 


The angle 7; is called the scattering phase or the phase shift. To 
-see where the name comes from, note that by using (2.29) we obtain 
for r—> œ 


œ sin (xr— In/2+ nd) (2.52) 


Gf. — 
Alr —> oo) = const; ur 


It can then be seen that the solution (2.52) is, in a way, shifted 
‘by an angle y; relative to the j, (xr) (2.29). Note, however, that 
there is no phase shift in the denominator of (2.52), which means 
that this interpretation can only be made with caution. 
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The magnitude of the phase shift can be found from 


tan y (E) =——S1@ rE 8 S (E, r= 00). 
const?— x į nı (xr) V (r) Rı (r) r? dr 
Ò 
(2.53) 
One more useful relation is 
const? -cos n; = A/by. (2.54) 


2.2.2. Normalization conditions for the wave functious. It is a 
usual textbook statement that the normalization of a wave function 
is arbitrary in scattering problems and it is only relative quantities 
which matter, such as reflectivity or transmissivity. This is undoubt- 
edly true. However, it is the particular choice of normalization that 
specifies the form of the wave function of relevance. Moreover, as 
the following examples show, the choice of normalization determines 
the way in which the phase shift is expressed in terms of .%). 

(A) Plane-wave normalization: 


b, = A. (2.55) 
Then, by (2.53), 
tan q= —x | ji (oer) V (r) Ar(r) rar, (2.56) 
0 
2 (r) =5, (er) — tan n (E)n, (ur). (2.57) 
(B) Normalization at the origin: 
consti = 1, (2.58) 


oo 


tan yn, = — x j jV Rir? dr [(1—x | nV Rir? dr) » (2.59) 
0 0 


The radial function 2, is given by (2.51) with the phases determined 
by (2.59). Note that it is this normalization which is adopted in 
books on the phase function method [27, 28] (see Sec. 2.3). Expression 
(2.59) is identical, for example, with Eq. (6.18) of [28] (note, though, 
that #, is replaced in [28] by u; = #,/r). 

(C) Normalization at infinity: 


const; = 1, (2.60) 
sin qn = —x \ iV.% dr, (2.61) 
0 
‘Ry = COS p+ ji (xr) — sin yer (xr). (2.62) 
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This normalization is used when the behavior of electrons is stud- 
ied far away from a scatterer at the origin (e.g., an impurity atom, 
see [29]). 

(D) Scattering-oriented normalization: 


bp = —-— “4 \ ny (ur) V (r). Pı (r) 2 dr. (2.63) 
0 


S1(r—- œ) 


Then we have 
tan N = —x \ n, (ur) V (r) Ry (r)r?dr, 
0 


and finally 


A 
const] = — — tan n, const? = const? -cos n; — tan ng, 
L 


A, = (const + tan yy) (ji (xr) —tan(m-ri(ur)). CS9 

If the amplitude of the function is known for both small and large 
distances, we can evaluate the phase shift. Conversely, from the 
large-distance amplitude and shift we can determine the function 
at the origin and ignore its intermediate behavior. This is, in fact, 
a manifestation of the principle underlying the concept of the pseudo- 
potential: the phase shift contains a “memory” of the interaction 
times nx, where n is an integer. Note that for the asymptotic value 
(2.52) the phase is needed to within 2nm, which is exactly the accu- 
racy provided by Eq. (2.64). 


2.2.3. Logarithmic derivatives of the radial functions. We thus 
know that far away from a force center (or at r > R, for a potential 
of (finite) range R) the wave function .%, is given by (2.51). The 
expressions we derived for y; are in an integral form and for this 
reason are not very practical. There are a large number of methods 
(both variational [30-32] and not [33-37]) with which to calculate 
the phase shifts. The one that follows is the most commonly used. 

If no boundary conditions are imposed, Eq. (2.25) can be solved 
numerically for any value of E. We can therefore integrate* 
Eq. (2.25) to a certain r (which may, for example, be the range R} 
and then demand that X, join smoothly to (2.51): 


lim R, (r= R — ô) = const? (cos yr: jı («R)— sin ni- n (x R)), 
6>0 


lim 34 (r = R — ô) = const? (cos niji (xR) — sin neni (xR). ©”? 
6>0 


* Any differential equation can be numerically integrated without bounda- 
ry conditions being specified. Initial conditions are, of course, necessary. 
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Introducing the logarithmic derivative of the radial wave function, 


n (E) = |E ay a PE) y (2.66) 


we obtain for the phase shift 


ji (%R)—jı (XR) (E) 


tan N: (E)= ni ( (xR)—nı (xR). -M ( E) ’ 


(2.67) 


where the primes denote differentiation with respect to r. 

Let us discuss the & dependence of the function 4;. The function 
i, (E) has zeroes at energies Ef and it has singularities at energies 
€, determined by* 


4 9, (r, E) |-r=0, (2.68) 
Ai(r, £) |rar=0. (2.69) 


To see whether A; is a decreasing or increasing function of E, 
we write out (2.20) for energy £,, multiply it by .#, (E)r? and 
integrate over r from 0 to R. Next we take (2.25) for energy E,, 
roultiply.it by .#7, (E) r” and integrate over r. Subtracting the second 
integral from the first, we get 
[r2 (R, (E2) Ai (Ey) — Bi (E1) Ai (E2)? 

R 


= (Ey— Ep) | Ay (E) #1 (Ba) rar, 


0 
where use has been made of Eq. (2.14). Now let Æ, tend to £;: 
E, — E= 2x ôx, &,(E2) = Ri (E) + (0.%,/0x) dx, 


| =n -i a (r, E) |E -f 23 (r, E)r?dr. 
0 


As r tends to zero, #, œ r', 


(2.28); consequently, 


in accordance with Eqs. (2.47) and 


hy (r — 0) œ Ir7, (2.70) 
and we have 


R 
n 1 
ap M (R, E)= pea { Fir, E)redr. (2.74) 
l 0 


Since the right-hand side of (2.71) is negative we conclude that 
à, (E) is a monotonically decreasing function apart from the point 
e€, where it changes from minus to plus infinity (see Fig. 2.1).4 


* Note that these conditions are similar to those used for the bonding and 
antibonding functions in diatomic molecules (cf Chapter 4 in [37]). 
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2.2.4. The behavior of tan y as a function of E. Since 4; may assume 
any value, it should be expected that at certain energies either the 
numerator or denominator in (2.67) will vanish and tan ny, (E) 





AE) 





Ei 





Fig. 2.1. Logarithmic derivative 4, versus energy. (See Fig. 4.7 for actual 
dependence.) 


will have zeroes and singularities similar and related to those of 
à (E). 

The position of a singularity of tan y; (E) (or a resonance energy, 
to use the accepted term) can be found by solving the nonlinear 
equation 


M (E) = nī’ (xR) -Ż m (xB). (2.72) 


In other words, when Ž, is matched to an irregular solution nı 
of the homogeneous equation, the scattering gives rise to a resonance. 

Let us define a hard-sphere potential as being infinite at r < R 
(R being the radius of the sphere) and zero at r > R. (We shall see 
later that this is an “MT-type” potential.) In this case the solution of 
Eq. (2.25) for r< R is zero at any energy: .#; (E, r) = 0, which 
means that, whatever the energy, the logarithmic derivative A; (£, r) 
is infinite. Equation (2.67) then gives the phase B, for hard-sphere 
scattering: 


tan Bı (E) = jı (“R)/n; (xR). (2.73) 


On the other hand, if for an arbitrary scattering potential and an 
energy E = e, Eq. (2.69) is valid, then we have a singularity in A, 
(E = e,) by (2.66). This means, in turn, that at E = g; the potential 
acts as a hard sphere, or 


ni (£1) = By (€). (2.74) 
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It follows that at E = e, the true potential may be replaced by an 
infinite repulsive one, which supports our earlier conclusion that, 
provided a new potential is appropriately chosen, it may be substi- 
tuted for the true potential V (r) without distorting the description 
of the scattering process. 

We have thus discussed the behavior of tan yn; near E = e; With 
regard to its low-energy behavior let us consider the scattering 
problem for a square well of depth V, and radius R. We denote x) = 


VE +V, to obtain the solution of (2.25) as spherical Bessel func- 
tions j; (xR) and 


í (xR)— ji (xR) ji (%oR)/71 (xR) r 
tan ni (E) = eR) mi GR) ji R) R) (2.79) 


To study the low-energy behavior of tan yn; we may use the asymp- 
totic values (2.28). It can be seen from (2.73) and (2.75) that in 
both cases considered above the low-energy behavior of tan y; is 
the same; indeed, it is the same for all finite-range potentials, viz. 

id | 
lim tann (E) E 2. (2.76) 
LE>0 

Next we turn to the behavior of tan yn, near a resonance. Itfis 
tempting to interpret the conditions for “bonding and antibonding”, 
Eqs. (2.68) and (2.69), as arising from the interaction between a 
certain level (of, say, energy E1”) and a perturbation. The pertur- 
bation may result, for example, from a change in boundary con- 
ditions: a previously discrete level appears, “all of a sudden”, above 
the energy zero and therefore becomes quasi-discrete. The electrons 
may then leave this level and spread throughout space or, to put 
it another way, the electron lifetime at the level becomes finite 
instead of being infinite as in the discrete case. For the rest of our 
discussion we shall use the Schrödinger representation [7, 38] in 
which the wave function depends harmonically on time: 


tp (1) = (t= 0) eee, 
To describe the time decay of this quasi-discrete state, its energy 


should be considered a complex variable (in distinction to a discrete 
level): 


Er = E,+ tA). (2.77) 
The quantity A;, known as the level width, is related to the elec- 
tron lifetime q; at the (quasi-discrete) level by 
Tı Å; ~ 1. 


Instead of the width A,, it is customary to use the halfwidth T; = 
A,/2. Assuming the existence of a quasi-discrete level of energy E, 
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and the halfwidth T, we obtain [39]: 


0 n 
tan n; = MTY (E,— E) 


T—tannp Ti (EE) * (2.78) 


From this (see [39], Eq. (134.10)) there follows a relationship for 
the phases: 


T 
m =ni + are tan -pF >» (2.79) 





where n? (E) is the scattering phase for energies far away from Æ., 
Equation (2.78) can be rewritten formally as 


T; 
tan n, = tan nt + -py > (2.80) 


where 
Ti = T; (1 + tan? ni), 


2.81 
E= E, —T, tan nf. ( ) 

We are now in a position to determine the behavior of E, (E), 
taking into account, of course, that for (2.76) to hold at low energies, 
we must have 


+E 
T=, (E)x E 2 (2.82) 


It follows from (2.80) that the energy of a singularity of tan n, 
is not identical to a resonance-level energy. The presence of the 
potential-scattering phase (denoted yf to distinguish it from the reso- 
nance case) renormalizes the resonance energy much in the same way 
as friction affects mechanical resonance. This means that Æi can be 
shifted relative to E, by T, tan n? (or about 0.1 Ry to 0.3 Ry). In 
the theory of the transition metals (see Figs. 6.4, 6.5 and Secs. 2.3, 
5.1) a d band arises from a quasi-discrete level, the energy of which 
is usually thought to be the average energy of the band. This is not 
the case, however: we shall see in Sec. 5.1 that the average energy 
is actually E; 

Equation (2.81) may be regarded as the first-order result obtained 
in the LCAO model: the role of the perturbation is played by the 
deviation of I, from zero and the perturbation itself is caused by a 
change in boundary conditions. The level is shifted by the magnitude 
of the “perturbing” potential. 


2.2.9. The relationship between the energies of the singularities 
of tan q and 4,;. To find this relationship we first write out the energy 
as a function of the phase shift. It follows from (2.79) that 


E = E +T, cot (ni—)). 
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The singularity in 4, implies that Eq. (2.74) is satisfied. This means 
that at HE = €; the phase y; equals §,;. The desired equation thus is 


e,=£,+T, cot (yi — By), (2.83) 


where alljthe quantities are evaluated for the energy e¢;. The closer 
the scattering is to the hard-sphere case, the farther g; is from F;. 
This means that in the energy range under study one of the two 
singularities may be present and the other not. It should be noted 
that the hard-sphere scattering need not always be strong: the phase 
B: is small when j; approaches zero. With d electrons, the asymptotic 
form (2.29) is correct for xR > 3, the zeroes of j, are determined from 
the condition xR — m = nx and, accordingly, the asymptotic value 
(2.29) may be helpful in locating even the first zero. In band cal- 
culations, xR is ordinarily somewhat less than 3 so that the hard- 
sphere d scattering must be weak for energies of about 1 Ry which 
is, typically, the upper energy limit for such calculations. 


2.2.6. Levinson’s theorem. The above formalism is also applicable 
at negative energies because Eq. (2.25) is integrable for any Æ, in- 
cluding & = —x’*, the only difference from the E = +x? case being 
in boundary conditions for 4, at infinity. 

The negative-energy solution of Eq. (2.25) is again a superposition 
of regular (j;) and irregular (n,;) solutions, regarded as functions of 
the imaginary argument ixR. The boundary condition corresponding 
to a bound state is that the normalization integral is finite. This 
implies that as r tends to infinity, #, should go to zero faster than 
r>. The only combination to satisfy this condition is the function 
yi, which represents an outgoing wave (see (2.34)). Comparing (2.34) 
and (2.57) yields the condition for the existence of a bound state: 


tan n, = — i. (2.84) 
Let us set up the function @,=ix/,, where 


hag e sin n= Ze 1). (2.85) 
The function f, is called the partial scattering amplitude and its 
elements between |k) and |k + q) form the ¢-matrix. In the com- 
plex energy plane the scattering amplitude has poles at the energies 
of the bound levels. 
Let us introduce the function 


Y d 
f=- In| 2q,-+1| 


and integrate it over the interval O< x < +œ. As usual, the 
integration contour must enclose all the poles of the function f/f, 
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which coincide with those of fı. The integral will be —2ni times 
the number of singularities of the integrand (alias the number of 


bound states, N) (see [40]). On the other hand, the definition of hr 
and Eq. (2.85) indicate that this integral must be 2i (n; (0) — n: (co)). 
We thus have arrived at Levinson’s theorem [41], namely 


nı (0) — n (00) = tj. (2.86) 


Although, it must be admitted, our derivation is not perfectly 
strict, the theorem has been rigorously proved elsewhere [42-44]. 

The phases involved in (2.86) are the usual phases for positive 
(that is, E = +x?) energies.* 

There are two conclusions to be drawn from the above discussion. 
Firstly, scattering theory can yield the energies of bound states. 
Secondly, Levinson’s theorem reveals that the number of levels at 
E < Q is related to the behavior of the phase at E > 0. lt is this 
remarkable fact that will allow us to introduce the concept of the 
pseudopotential. 

Since it is clear that at high energies V (r) is negligible as compared 
to £ and equation (2.25) therefore becomes homogeneous, the phase 
shift is usually subjected to an additional condition, namely n; (oo) = 
0. In this case Levinson’s theorem states that 


nı (0) = "Ni, (2.87) 
except for the 1 = 0, E, = 0 case, when n; (0) = n (N; + 1/2). 


2.2.7. True and spurious quasi-stationary levels. The quasi-discrete 
(resonant) level becomes a discrete one when the attractive potential 
gets stronger: the E, level goes down and the quantity I’, decreases, 


that is, the energy E appears on the real axis. At E, = O the 
resonance width is also zero and the potential gives rise to a bound 
state, thus affecting the phase (see Fig. 2.2d). 

It can be seen from Fig. 2.2b, c that the phase passes through 7/2 
(at energy £,,) either when increasing or decreasing. But as the 
potential increases still further, the low-energy part of the n, (E) 
curve “bulges out” and Æx moves toward higher energies (see 
Fig. 2.2d, e). The picture then repeats itself, this time shifted up- 
ward by x. The energy £, will thus never fall into the region of bound 
states. This may be seen from Fig. 2.2 where a zero-energy bound 
state is shown: for E, to reach (at least) zero, the phase shifts must 


* We have considered an individual local potentialfor which bound states 
are only possible at negative energies. Levinson’s theorem has not yet been 
proved for nonlocal potentials capable of producing positive-energy bound states. 
Recent attempts [45, 46] to interpret N; in Eq. (2.86) as the total number of 
bound states (at both negative and positive energies) have proved to be incorrect 
[47]. See also [48, 49]. 
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Fig. 2.2. Phase shift ņ; versus energy Æ for potentials of different strengths, The 
potential deepens from (a) to (e). 





vanish at E = 0. It is clear intuitively that this is impossible and 
indeed it has been shown [50, 42] that the magnitude of the derivative 
dy,/dE is bounded from below, i.e. 


—xR <t, = 2dr; (E)/dE L + œ% (2.88) 


The quantity T; is called the electron’s scattering time delay. The 
negative values imply that a scattered electron sort of outruns a 
free one instead of being retarded as before (this may be seen from 
Eq. (2.52)). The lower bound is due to the causality [42]. 

The resonances of tan y, at E, has thus nothing to do with the 
presence of a quasi-discrete level. 

The resonances at E, and E may be distinguished by their width. 
The minimum resonances width Tix is of the order of 1/t7f. It follows 
from (2.88) that IP." ~ (RY E,)- and if the resonance width of 
tan y: is less than Tix , we may be confident that the resonance occurs: 
at a quasi-discrete level. This argument may be helpful in inter- 
preting atomic physics experiments or in a preliminary analysis of 
the band structure of a solid. Figure 2.3, for example, shows the phase 
shifts found in [51] for crystalline Al (the shifts are given in radians; 
the s shift is diminished by 2x and the p shift by x). Note that there- 
are d and f scattering shifts in this case, even though the Al atom. 
contains neither d nor f electrons. 
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The difference between the phase shifts of s and d electrons can be 
clearly seen. It may be argued that the d electrons must be scattered 
by a quasi-discrete level, that is, a d band is described in terms of the 
LCAO method. At the same time 
it is clear that even though thes pi 
phase will pass through 1/2 this 
is actually an x-type resonance, 
so that the LCAO scheme is not Ml yr? 
suitable for describing the s band | 
(because of the lack of the orig- 
inal level) and instead the NFE 
model is needed. A calculation 
of the Fermi energy in Al by 
Eq. (1.32) shows that Eb < E= 2, 
that is, in crystalline Al only s 
and p scattering is significant and 
the Al band structure must be 
calculated by the NFE model. “O 
‘This latter circumstance is con- 
firmed by practice. 

Note that the low-energy phase 
shifts obey the rule: 





M+ r? 


No > N > No > Ya (2.89) 





Fig. 2.3. Phase shifts for Al as de- Fig. 2.4. Formation of a quasi-bound 
rived from a crystal potential [51]. state: (a) the minimum of a free-atom 
potential, (b) the minimum of a short- 
range potential, (c) Friedel oscillations 

on the tail of an MT potential. 


which follows from the asymptotic behavior given by Eq. (2.76). 
Physically, the explanation for this is that the centrifugal potential 
U,=/1 (t+ 1)r% in Eq. (2.25) is a repulsive one: it prevents the 
electron from approaching the origin. As / increases so does the inac- 
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cessible area, the effect of the potential on the electron weakens and 
hence the phase shift gets smaller. Clearly, the inaccessible area 
should be smaller for higher-energy electrons. 

Although the centrifugal potential is of a purely classical nature, 
it leads to a very interesting quantum mechanical consequence. 
When r— 0, the total potential Uert = V (r) + U(r) œ +r, 
when r—> œ, Uep œ —r™. Obviously, Uep has a minimum (see 
Fig. 2.4a). If, however, V (r) decreases faster than r~? at r— oo, 
then Uet œ +r? which means that the function Uett (r) must 
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- _— mM 
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Fig. 2.5. Characteristic contributions to the crystal potential. The example 
shown is Pt [52]. 


also have a maximum, as shown in Fig. 2.4b. The quasi-discrete level 
from the above discussion may then be viewed as a resonance state 
in this potential (Fig. 2.4b). 

The actual relationship between the characteristic contributions 
to the crystal potential is shown in Fig. 2.5 for Pt [52]. We can see 
both the radius of the sphere inscribed within a Wigner-Seitz cell 
(“muffin-tin” or MT sphere and hence the MT-radius) and the radius 
of the Wigner-Seitz sphere. The atomic potential is shown for com- 
parison. The shaded area indicates the approximate location of the 
d band levels. 


2.2.8. Friedel’s sum rule. We have thus established that depending 
on the energy of the scattered electron, the potential may act either 
attractively or repulsively. It would thus be appropriate to charac- 
terize a potential by the difference between the total number of 
electrons it scatters and the number of electrons that would be present 
in the absence of the potential. To evaluate this parameter we employ 
Eq. (2.71), which yields the number of electrons with energy Æ that 
are contained within a sphere of radius R. When R is large enough, 
we can use the asymptotic value (2.52) with the normalization 
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’ 


R 
fn Ri(r, E)r? dr 


L 


4n œz 1 dy 1 |. la 
=- (constr) oF | R4+ SL zy Sin (2 (xR-—+n,))]. 
Setting n; = 0 we obtain the number of free electrons formerly 
within the sphere. Subtracting one from the other, we have 


Api (E) = [ t (E) — F" cos (2xR— ia +n (E) ]. 

This is the number of electrons that were in the state with energy 
E. To fnd the total number of electrons scattered by the potential, 
we must multiply Ap, (E) by the unperturbed density of states, sum 
over all J, and integrate the result over the energy range from 0 to 
Ey. Taking spin into account gives 


EF 
AZ = 20 \ V E J, (21+ 1) Ap, (E) dE = [F (Er) — F (0)) 
0 l , 
E$ , 
—+ | > (21-+ 1) ae cos (2xR—In-+y,) dE, (2.90) 


l 


where we used the Friedel sum: 


F (E)=Ž J (214-1) (E). (2.91) 
l 


The integral in Eq. (2.90) cannot be carried out analytically unless 
the yn, (E) function is specified, but at large R it can be shown to be 
proportional to 1/R. This can easily be verified for small phase 
shifts. 

If a potential does not produce bound states then, by the Levinson 
theorem, ¥ (0) = 0. If it does, then the choice of energy zero becomes 
important. If the bound states are not relevant to the problem, the 
energy zero should be chosen so as to place them below the energy 
range of interest. We may then use the degree of freedom in the def- 
nition of the phase shifts to subtract nN, from each shift. This, ina 
sense, is a change from the original potential to another potential 
(or, more properly, pseudopotential) that has no bound states. But 
then, by the Levins n theorem, F (0) will again go to zero. Thus, 


AZ = F (E$). (2.92) 
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Equation (2.92) is usually employed for impurity studies in crys- 
tals. In this case, AZ is, by definition, the difference between the 
numbers of valence electrons in the host and impurity atoms. The 

* in (2.92) is the matrix Fermi energy and (2.92) is then a certain 
condition imposed on the phase shifts for impurity scattering. This 
condition is known as the Friedel sum rule. Using this rule in the 
form of (2.90) we can find (provided we have correct phase shifts) 
a quantity R which may be regarded as the effective impurity radius 
or the maximum distance at which the impurity potential is still 
felt. A knowledge of the effective radius makes it possible to find 
the maximum impurity concentration beyond which the impurities 
start interacting with each other to form impurity bands (at which 
point the alloy may no longer be considered a dilute solid solution). 

An impurity atom is sometimes modeled by a square well with 
a depth chosen to satisfy the Friedel sum rule, Eq. (2.92). The phase 
shifts so obtained are then used for calculating the residual resis- 
tance due to impurities. The results [53-56] were in qualitative agree- 
ment with experiment. The Friedel sum rule turns out to be quite 
a reliable computational tool, even if the integral in (2.90) is dis- 
regarded. 

Now it is natural to ask what the Friedel sum is for a pure crystal. 
On the one hand, this must be the difference between the valencies 
of the host and impurity atoms—which is zero. On the other hand, 
this is the number of electrons that, in the NFE model, must be 
removed from an atom in the ion formation process, to be given 
back when the ion enters the crystal. This means that the Friedel 
sum is by no means zero but rather equals the valency of the atom, Z. 
With this approach, the Friedel sum rule is also valid for an impurity. 
We thus encounter a paradox whose resolution will be postponed 
until Sec. 3.5. 

Since the integral in Eq. (2.90) is an alternating sign function of 
R, AZ oscillates with R. To see how this arises, let us consider the 
r dependence of the /th component of the electron density. We start 
from the free electron case. 

Far from the origin, the number of electrons with a given / may 
be written 


EF 
Ptree (r)~ | V Ej (xr) dE 
0 


== (r VER = sin (2r V E}—6,) — + sin ô), (2.98) 


r 


where we have introduced the phase 6, = (1/2) ln. Apart from the 
leading term, which is of the order of r~?, there is an oscillating addi- 
tive term that decreases as r^. 
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Now if a scatterer is placed at the origin, Peat is given by an equa- 
tion like (2.93) but with the 6, replaced by (1/2) in — nı, hence 


Ap! (r) = Pscat — Ptree 
(—1) .. , To a 10 
~ sin y, [cos (2r V E + n) -cos n (E$)]. (2.94) 


r3 





The excess density due to the presence of the scatterer falls off 
as r™ and has a term that oscillates with distance, being known as 
Friedel oscillations. It is this term that causes the AZ oscillations. It 
should be remarked that Eq. (2.94) also contains a term which does 
not oscillate but which is disregarded in most books on band theory. 
A similar result was obtained in [57]. 

The density distribution as given by (2.94) is associated with the 
Coulomb potential Ve (r). The potential must also affect the electrons 
including those producing it. We have thus started with a finite- 
range potential and arrived at a long-range one (see Fig. 2.4c). Our 
calculations thus lack self-consistency. What is more, we can no 
longer, with the potential Va, consider n; to be an assumption used 
in the derivation of Eqs. (2.90) and (2.94). 

That there can be no short-range potentials in an electron gas is 
important in solid state theory, for analogous potentials are the 
ones used to model all interatomic interactions. We shall see in 
Sec. 3.2 that the interatomic interaction potential of Chapter 1 
also yields characteristic Friedel oscillations. 


2.3. Pseudism and scattering 


From now on we shall frequently use the word “pseudism” as pro- 
posed in [17] as a general term to denote all phenomena associated 
with pseudopotentials. 


2.3.1. Pseudism and the periodicity in the properties of the chemical 
elements. For the sake of simplicity, consider a one-dimensional square 
well of depth V,. The wave function in this case is a plane wave 
exp (ikz), for which VE = k outside the well and inside k = 
VE + Ve. Thus the wavelength inside the well is shorter (and the 
oscillation frequency higher) than outside. This then is the reason 
for a path difference (or phase shift) between an clectron surmount- 
ing the well and the free motion casc.* 

A rise in the oscillation frequency indicates the greater wave 
function curvatures and hence larger V?W. Thus oscillations reflect 


* The effect of bound states on scattering is for the moment neglected; we 
shall discuss it later. 
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increases in the electron’s kinetic energy due to increases in the 
(absolute) magnitude of its potential energy. A phase shift is thus. 
a manifestation of the virial theorem (for more details, see [29]). 

Now if the well is made deeper the curvature of the oscillations. 
increases, the time delay gets longer and the scattering stronger 
until at a certain value Vi? a zero-energy bound state is produced. 
The situation may then be described by saying that the level “traps” 
the electron and “holds” it for a time (hence the time delay) the 
duration of which is related to the energy of the level by 


t, œ (E -| E |). (2.95) 


Further increases in V, push the level in the well to lower energies, 
the quantity t, in (2.95) is made smaller and the scattering weaker. 

When the potential becomes strong enough to contain another 
bound state, the scattering becomes stronger and the situation is 
repeated. As bound states with new orbital numbers are produced, 
the scattering process involves higher partial components of the 
plane wave. Resonances will be observed for the ? > 0 components. 

lf the square-well atomic model is adopted, atoms with larger 
nuclear charges will be approximated by deeper wells. By varying 
the well depth, potentials for all the chemical elements may be model- 
ed. 

The variation of the phase shift with well depth (for square-well 
scattering) is illustrated in Fig. 2.6. The figure shows the wave func- 
tion in the absence of scattering (Fig. 2.6a), weak scattering by a 
shallow well (Fig. 2.66), strong scattering by a middle-depth well 
(Fig. 2.6c). Finally, Figure 2.6d shows that as the well is deepened 
still further, the scattering may again become as weak as that in 
Fig. 2.60. If the function’s behavior inside the well is ignored, then 
Figure 2.66 and Figure 2.6d are the same. It can be seen that the wave 
function is “pulled in” as the well gets deeper. 

In Fig. 2.6d the shallow potential of Fig. 2.66 is also shown. The: 
shaded area affects the magnitude of the formfactor by contributing 
to the integral, but it has no effect on the scattering or, to put it 
another way, on the formation of the crystal energy spectrum. 

The above discussion is important enough to deserve a review. As. 
illustrated in Fig. 2.6, the periodic change in the scattering pattern 
with increasing well depth is associated with the wave function 
being “pulled” into the well, or, in other words, with the phase shift 
being only determined modulo x. This indeterminacy suggests, as- 
we have already remarked, that the true potential may be replaced 
by a pseudopotential. On the other hand, the periodicity in the 
strength of scattering implies periodicity in physical properties. 

The NFE model depicts a metal as a gas of free electrons scattered 
by an atomic lattice and, as follows from the factorization of the 
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matrix element, Eq. (1.21), the interaction of an electron with a given 
atom is, in effect, independent of the rest of the atoms. 

Around each of these atoms oscillations in charge density (see 
Sec. 2.2) and hence in screening potential will arise. Each atomic cell 
is, by definition, electrically neutral. It then follows that the inter- 
atomic interaction, which is the sum of a direct term (Coulomb 
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Fig. 2.6. Variation of the phase shift with the depth of the square well. 


repulsion) and an indirect one (the interaction via conduction elec- 
trons), reduces to the indirect term alone, i.e., to the oscillating 
potential Ve. 

Suppose that at some moderate distance r, from a given atom (A) 
there is another atom (8). Generally, the atom B experiences a force 
Fit = —grad Va. Depending on the sign of Faş, the atom B will 
be either attracted to atom A (the equilibrium separation a is then 
smaller than the initial ry) or repelled from it, whence a > ry. As the 
atomic number of/A increases (or the square well deepens) the quan- 
tity Ve (r = ro) oscillates, since the wave function is “pulled in” 
by the atom A. Consequently, Fat changes sign. This means that 
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the equilibrium interatomic distance (the lattice parameter) is an 
oscillating function of atomic number, a phenomenon which has 
indeed been observed [58-60]. 

In the same manner pseudism manifests itself in the oscillatory 
behavior of other macroscopic crystal properties. The moduli of 
elasticity, for example, are determined by the slope of V,; at atomic 
sites (i.e., by the elastic forces F+) and therefore oscillate with in- 
creasing atomic number. The same forces determine the elasticity 
of a vibrating lattice, that is, the phonon characteristics of crystals. 
For example, the Debye temperature varies periodically with atomic 
number [58-60]. 

Since, by the Lindeman criterion, the melting point is related to 
the Debye temperature [3], some correlation should exist between 
the melting temperature and the cohesion energy. This is indeed 
the case [58]. 

The same elastic forces determine the thermal expansion coefficient 
which, accordingly, also shows periodicity (for example, its correla- 
tion with melting point is reported in [58, 61]). i 

The concepts of pseudopotential and of a “pulled-in” wave function 
have thus proved to be effective tools for interpreting crystal prop- 
erties (for more examples, see [17)). 

We pointed out in the Introduction that even the periodicity in 
the properties of the elements can be readily explained in terms of 
pseudism. 

As the atomic number increases, the outer shell is gradually filled 
by electrons until at some moment it becomes a “core” shell. The 
atomic core is thus periodically re-defined, which means that the 
potential felt by a valence electron is repeated within the periodic 
table (with some variations in detail though). 

It should be emphasized that the mere existence of bound states 
does not necessarily imply that the scattering is strong. It is how- 
ever strong if (a) these states are close to the zero energy (see Eq. 
(2.95)) or (b) they result from a small variation in the potential (vir- 
tual states in the terminology of [39]) or (c) the scattering potential 
has quasi-bound (or resonant) states. The last condition follows 
from the definition of the time delay, Eq. (2.88), combined with 
Eq. (2.79): 


T 

Tı (E) =t (E) +2 -p HTT 
The time delay is given by the classical Breit-Wigner formula and 
is therefore a Lorentzian curve superimposed on the smooth back- 

ground due to the potential scattering with phase ni (E). 
Thus, from the point of view of scattering theory not every poten- 
tial with bound states gives large corrections to the free-electron 
dispersion law Æ = k? (see Eq. (1.31)). But then a paradox arises: 


(2.96) 
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on the one hand, these corrections clearly must be small, on the 
other the corresponding perturbation expansion must diverge. 

It is intuitively clear that the shaded area on the potential curve 
of Fig. 2.6d, while not contributing to the phase shift, must lead 
to the divergence of the perturbation series. This area may not sim- 
ply be “cut off’, and the origin of the phase shift should therefore 
be discussed in more detail. 


2.3.2. Phase function method and pseudopotentials. Let us define 
the phase function n; (r) for all r by demanding that the solution 
#, of the Schrodinger equation have the form (2.45) for all r, the 
phase shift n; (r) being defined as in (2.48). The ambiguity due to 
the appearance of two unknown functions (C, (r) and S, (r)) instead 
of one (F, (r)) can be removed by imposing an additional condition 
on .#?,. It is convenient to write this condition in accordance with 


(2.65): 
ZA =O) aar) Si (r) Sur). (2.97) 


Then, by substituting (2.45) into (2.25), and expressing C; in terms 
of Sı, we obtain differential equations for C, and S, leading, in 
turn, to an equation for the phase function 1), (r): 


S tann, (r, E) = — xV (r)r? [ji (xr)— tan n, (r, E)n, (xr)]?. (2.98) 


This is the basic equation for the quantum-mechanical phase-function 
method [27, 28]. 

By performing mentally a numerical integration of Eq. (2.98) 
we can get an idea of the r-dependence of the phase shift. 

Let the energy of the electron be close to zero. At r = Q the right- 
hand side of Eq. (2.98) is zero. From Eq. (2.47) it follows that 
nı (r = 0) = 0. Increasing r means that increasingly more regions 
of the potential are made to participate in the scattering process, 
causing thereby a gradual increase in the phase. 

At a certain value of R,, the “already integrated” region of the 
potential produces a zero-energy bound state (Æ, = —0) on which 
the incoming electron (E, = +0) is resonantly scattered. The right- 
hand side of Eq. (2.98) (which is the derivative (d/dr) (tan y,)) must 
then be large. At the point where the bound state “appears” the phase 
shift jumps. In accordance with the Levinson theorem (2.87) the 
phase must be nV). 

As the integration goes on, the phase shift jumps each time a bound 
state with a given l “appears” (see Fig. 2.7). 

When, for example, the potential well defined by the curve be- 
tween the points —oo, V), R{'’, gives rise to a zero-energy bound 
state, then, by the Levinson theorem, yn; = x. We can introduce 
three pseudopotentials (i = 1, 2, 3) which lead to the same (modulo x) 
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small phase. Each pseudopotential is zero for r< R\” while, for 
larger r, they coincide with V (r) from V on. The most suitable 
potential is i = 3, shown by the 
bold line. 

If the scattered electron has 
a nonzero energy, the jumps in 
phase are rounded off, which 
means that the positions of R, 
depend on the electron’s energy. 

Instead of the true potential, 
we may consider, at a given en- 


ergy, a corresponding pseudo- 
potential: 

0, r<R,(£), 

Wilt, D={ yoy r>R,(E). 

(2.99) 


This may be seen by noting 
that, according to (2.98), the 
phase shift does not depend onr 
in the zero potential. For both 
the true potential V (r) and the 
pseudopotential W, (r, Æ) the 
scattering phase shift accumulat- 
ed to the point r = R, is aN, 
and the wave functions Y and 





Fig. 2.7. Above: the build-up of phase 


AW at this point (as well as 
their derivatives) will coincide. 
At rœ R, the phase equations 
for the potential and pseudopo- 
tential will be the same, which 


shift with gradual integration of the 

radial Schrodinger equation. Below: 

the potential V of the radial equation 

(normal line) and the pseudopotential 
(bold line). 


means that the wave functions 

will be also the same. We conclude that the removal of the core 
region of the potential does not alter its scattering characteristics. 
This allows us to introduce a pseudopotential [62, 66]*. 


* The variations in R; with energy turn out to be small. For Lit, Nat, 
and K+ they are negligible [63], for di- and trivalent elements they are larger. 
For Mg++, for example, they are 1.4% for the ! = 0 electrons within about 
0.47 Ry of the Fermi energy (as calculated for Mg in the NFE approximation). 
For Cat* the variations in R; -o are 3.3%, in Ry, about 1.7%. The variations for 

Al*++ turned out to be smaller than those for Ca++ [64]. Note that the calcula- 
tions were carried out for ions rather than for neutral atoms. To our knowledge, 
the phase function method has not yet been employed in pseudopotential theo- 
ry. 
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The qualitative ideas developed at the beginning of this section 
may now be complemented by the following remark which is obvious 
in the context of the phase function method. Even though elements 
in the same groups but in different periods have identical valence 
shells, they differ in their pseudopotentials because of the different 
numbers of bound states. Therefore, the elements in a new period are 
always different (however little) from those in the old one. 

In the 3d transition metals, for example, the d electrons are affected 
by the whole of the potential. As a result, the width T, of a resonant 
level will be smaller in 3d metals than in 4d metals, which have 
core d levels. Since I’; is a measure of the disturbance felt by the 
electrons (see Sec. 2.2) the d bands of 3d-metal will be narrower than 
their 4d counterparts. On the other hand, narrow bands favor ferro- 
magnetism and it is the 3d metals Fe, Co and Ni which are ferromag- 
netics. Note that these are end-period metals and their Æ, level is 
therefore low (by the argument of Secs. 2.2.7 and 2.3.1, #, moves to 
lower values along a d period becauseJas Z grows this level must be 
bounded, so that T’, also decreases toward the end of a period 
(Eq. (2.82)) and the d bands get narrower. For metals of other d 
periods, T; is greater and, accordingly, they are less inclined to 
ferromagnetism. Amongst the elements of a given d period the most 
probable candidates for ferromagnetics are those at the end where 
T, is relatively small. Both these conclusions are in good agreement 
with experiment. * 

It can be easily shown [65] that the wider the band resulting from 
an atomic (or, in this case, quasi-atomic) level, the higher is the 
cohesion energy of the metal. Since d bands get wider from 3d to 4d 
and then to 5d metals, the cohesion energy should increase from one 
period to another. This is also confirmed by experiment [58, 60]. 

The pseudopotential (2.99) shown in Fig. 2.7 is not%the only one 
to derive from the phase function method. Instead of an “empty 
core” with an energy-dependent radius, we could also consider a 
square well with an energy-dependent depth. Then 


— A, (£), rS Rn 
V(r), r> RR). 


The well depth A; should be chosen for each energy by matching, at 
r =: R, the inside and outside solutions: 


W,(r, E)= | (2.100) 


ji(Ri V E+A((E)) 
7 r=R,, EB). (2.104 
jı (Ri V EF A, (6) i( n Æ) ) 


The only thing, in fact, needed to carry through this procedure 
is to know how to construct the potential “seen” by the electrons in 





* A similar, if more complicated, situation occurs in f metals where rela- 
tivistic corrections seem to be of importance. 
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the crystal. A suitable procedure may be borrowed from the band- 
calculation method and is based on the use of secular equations (it is 
described in Sec. 3.5). The calculation of the function 4, (E) and 
determination of A; from (2.101) present no computational difficul- 
ties. We can therefore introduce a model pseudopotential that auto- 
matically incorporates the free-electron-gas screening of the atom 
(since screened potentials are used in band calculations). 

For a potential of (finite) range R, an equivalent square well may 
be constructed by using the matching procedure of (2.101) at r = R. 
The computation of the corresponding formfactor is then drastically 
simplified since there is no need to do a numerical integration out- 
side R}. 

Finally, it should be noted that applying the above argument to 
the NFE model makes the construction of a pseudopotential partic- 
ularly easy since outside the ion core the V (r) function has, of 
course, the form (1.26), where Z is the number of valence electrons. 
For such a potential, the solutions %2, are known: they are Coulomb 
functions [67, 68]. The right-hand side of Eq. (2.101) becomes analyt- 
ical and the evaluation of A, is then straightforward. 


2.3.3. The Lloyd pseudopotential. Scattering theory allows still 
another type of pseudopotential, which will be important later. 

Consider the difference A; (ZH) — J, (E), where J, (E) is the 
logarithmic derivative of the electronic wave function in the absence 
of a scatterer. The difference logarithmic derivative 4, — J, reflects 
the strength of scattering. Thus we have (cf. (2.66)): 





/ 1 d , 
nE = |e ior) | y (2.102) 
M= h= (Rim MI (hI) = —zRegee (24103) 


Here use has been made of the normalization (2.55)-(2.57). Finally: 


R? (m=i) == | AER y (ry I pedr, (2.104) 
0 





ji (XR) Rı (R, E) 


This equation links the difference logarithmic derivative with the 
potential that determines it. This equation is for À, like equation 
(2.56) is for tan 7). 

It is clear that Eq. (2.104) is satisfied not only by the true V (r) 
but also by the pseudopotential 


Wi(r, E) = [A (R, E) — Jı (R, E) ò (r — R). (2.405) 


It can readily be seen that substituting (2.105) into both Eq. (2.104) 
and Eq. (2.57) leads to identities. 
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The pseudopotential (2.105) was obtained by Lloyd [69] in the 
framework of band theory, and we have also come to it in a discussion 
of the isolated-center scattering problem. The Lloyd pseudopotential 
suggests that a crystal may be described by models other than the 
LCAO or NFE. This can be seen by combining Eqs. (2.103), (2.67) 
and (2.57) to give an exact equation 


tan yj (E) 


hy (LE) — J (E) = ~ ER? HR) ” (2.106) 


in which we used the phase shift yn; introduced by Ziman [70], i.e. 
cot ni (EZ) == cot nı — n (xR) ji (xR). (2.107) 


Note that the phase nj describes the difference between the scatter- 
ing patterns produced by the potential V (r) and the hard-sphere 
potential (2.73). The more 1, differs from ß,, the larger cot yj be- 
comes and the less the pseudopotential (2.105) affects the electron 
gas. This disturbance is the greatest for a hard-sphere-like potential. 
In this sense, a hard sphere is the strongest scatterer possible and 
any deviation from it means less effective scattering. 

We can follow this approach to develop a pseudopotential theory 
for a strong scatterer. The first step will be to calculate the band 
structure for a crystal whose actual atoms are substituted for by 
hard spheres. Next, we will consider the departure of the scattering 
potential from the hard sphere model as a perturbation and correct 
the solutions. Clearly, the solutions for the initial model will only 
depend on the lattice type and lattice constant. The solutions can be 
tabulated and then easily recalculated to a new lattice constant. 

Certainly, the band-structure problem for the initial crystal is 
not easy, but the formalism is available: this is the secular equation 
of the Green function method (see Sec. 5.1) involving only the quan- 
tity cot nz (see Eqs. (5.3) and (4.56)). 


2.4. Bound states, pseudopotentials 
and the convergence of series 


2.4.1. Bargmann’s criterion for the existence of bound states. Scat- 
tering theory thus helps us in choosing an initial pseudopotential. 
We are still at a loss, however, as to why the perturbation series 
is divergent for a potential producing bound states. Furthermore, 
we cannot be confident that the scattering-theory pseudopotential 
may be treated by perturbation theory. In other words, we do not 
yet know how scattering theory is related to NFE-model perturba- 
tion theory. These are the questions we take up in this section (which 
may safely be skipped by a reader not interested in the subject). 
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First, let us establish a criterion by which to judge whether or 
not a potential can produce bound states. To do this, note that 
Eq. (2.59) for a phase function may be rewritten for all r by changing 
the upper integration limit from infinity to r. 

If the electron’s energy is zero, the appearance of a bound state 
leads to resonance scattering, that is, tan n; goes to infinity, which 
is only possible if 


ny (ur) V(r) By (r, E)r? dr= 1. (2.108) 


ote 4 


We introduce the nth moment of a potential: 


B, = | V(r)r"ar, B = \ V(r) rar, (2 109) 
0 


1 
Oty 8 


where V denotes the attractive part of V (r). 

If E goes to zero, .#, œ jı (xr). Substituting the asymptotic values 
(2.28) into (2.108) yields a condition for there being no bound states 
with a given / in the potential i.e. 


aF IB, (<1. (2.110) 
This expression is closely related to Bargmann’s theorem [28, 71], 
which states that the number of bound states N, obeys the condition 


1 ~ 
Ni< apy Bil. (2.414) 


Our next concern is to find out a relation between the formulas of 
scattering theory and those of NFE perturbation theory. In other 
words, we wish to reformulate scattering theory in such a way that, 
instead of phase shifts or logarithmic derivatives, it contains pseudo- 
potential formfactors. Then, hopefully, the formalism of scattering 
theory might be used to develop pseudopotential theory further. 

Consider the scattering-theory integral equation (2.4). It can be 
rewritten formally as 


Y= o+G,V¥, (2.112) 


where ¥ is the wave function to be found, V is the scattering poten- 
tial V(r) and G, is the free motion Green’s function (2.12). This 
equation is known in scattering theory as the Lippmann-Schwinger 
equation. 

Solving equation (2.112) by successive approximations (i.e. using 
perturbation theory), we obtain the Born series: 


V=@+GV (P-+GV¥)=9+GVp+GVoGVo+... (2.113) 
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This expression can be obtained by formally rewriting (2.112) 
in the form 


W=(1—G,V) 9 (2.114) 
and then expanding in powers of G,V. The convergence condition for 
this series is therefore given by the operator inequality: 

GV<i, (2.115) 


which is only meaningful when it operates on ọọ. The convergence 
condition for the perturbation theory approach in scattering problems 
is thus 


I(r) >| | Go eV) oO) a" |. (2.116) 


Note that |@|=1 because ọ = ett". The inequality (2.116) 
holds for any r, so that we can set r = 0 without losing generality. 
For a spherically symmetrical potential, a coordinate system may be 
chosen in such a way that the vector k points in the z direction. 
Integrating over angles, we get [6]: 


| | V (r) (ezik — 4) dr |< 2k. (2.417) 


otg 


At low energies (for scattering from an isolated center, Æ = k?) 
the exponential can be expanded. By making use of (2.109), we 
find 


V BEB: <1. (2.118) 


At E =0 we have thus obtained a convergence condition analo- 
gous to (2.110). Note that in Eq. (2.110) only the attractive part of the 
potential is involved, whereas in Eq. (2.118) the whole potential is. 

If the potential has a repulsive part, Eq. (2.110) cannot be used 
as the convergence condition for the Born series (2.113). 

We can easily construct a model potential such that, on the one 
hand, it will produce bound states, and on the other it will ensure 
the convergence of (2.113) by criterion (2.118). The reverse case is 
also possible and it is important for the potentials that are chosen 
to ensure smooth wave functions (.%7, does not oscillate and has no 
zeroes, hence there are no A; singularities at r< R and no bound 
states in the potential). Such a pseudopotential may have a signifi- 
cant repulsive part (equal, for example, to the average energy of the 
scattered electron): in this region the wave function is virtually 
constant and, by (2.118), the series will diverge. 

Note that 4nB,/Q, (cf. (2.109)) is, on the one hand, the average 
value of the potential V (r) and on the other, the long-wavelength 
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limit (q — 0) of its formfactor (1.23). It may therefore be said 
a priori that if 


k|Bo| > 1, ie., k\(k|V|k)| > 4a0/Q, (2.419) 


then the Born series will diverge.* 


2.4.2. Born series in scattering and perturbation theories. Our next 
concern is to demonstrate that the scattering theory in the form of 
(2.113) is equivalent to the perturbation theory in the form of (1.31). 
We consider the first Born approximation{ for; the wave functions: 


Y = eikt 4 l G (r, r’) V (r) eitt’ dsr’. (2.120) 
It is easily seen that by (2.11) 
| Go(e, x’) V(r") ee dr’ 


eia -r') , ser’ 3 eat 
= \ > Sage V (renr dir’ = D) oe GIVI). (2-121) 
q q 
It is convenient here to separate out the vector k explicitly by 
shifting the origin. Taking into account that the order of summation 
in (2.121) is immaterial, we have 


í Go(r, r) V (r) ek dr’ = S SE cetera, (2.122) 
q 


where the right-hand side is nothing but the first-order NFE wave 
function. 

We have thus proved that for weak potentials satisfying (2.115), 
scattering theory becomes equivalent to NFE perturbation theory. 
Since the convergence conditions are the same for perturbation theory 
and for the Born series (2.113), it follows that we have also proved 
the validity of (2.118) for pseudopotential perturbation theory: the 
series (1.31) diverges for an everywhere attractive potential having 
bound states. 


2.4.3. The Austin-Heine-Sham theorem. Let us prove now a remark- 
able theorem which shows that in Eq. (2.112) (or, equivalently, in 
Eqs. (2.113) and (1.30)) a part of the potential may be “thrown away” 
without invalidating the equation. 

Let ¥ in Eq. (2.112) be an eigenfunction of a continuum state of 
the Hamiltonian H = —V? + V (r). lt must be orthogonal to the 


* We shall see later that typically in pseudopotential theory (k | W | k) = 
— (2/3) E%. This means according to (2.119) that the perturbation expansion for 
a single scatterer can only converge if Z < (2/m) (kp/k) = 0.6 (kpk). For 
scattering in crystals, the situation is much more complicated. 
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bound (discrete) states Pa (r) of the same Hamiltonian, with quan- 
tum numbers a = {n, l, my: 


aY) = \ Dë (r) Y (r) dr =0, (2.123) 


We set up a potential involving a projection operator (see (2.40)) 
onto the bound states M,: 


pound — 5 UP, = >) Ula) (al. (2.124) 
By (2.123) we have 


proundyy o, (2.125) 


Consequently, if (2.125) is added to the right-hand side of (2.112), 
the equation will still hold, and we may write 


FY = ọọ + GWY (2.126) 
where we have introduced the pseudopotential W (r): 
W (r) =V (r)+ >) U, (r)|a) la]. (2.127) 


From the point of view of scattering theory, this pseudopotential 
has the same properties as the original one had (the resulting wave 
function is the same), but it may be much weaker by perturbation 
theory standards. To make this clear, we evaluate its formfactor 


(see Eq. (1.23)): 
(k+ q|W|k) = (k+q|V |k) + È (k+ qlU al) (ajk). (2.128) 
a 
Suppose we can separste out a region (of radius Re) where the 
functions P, are localized; this is in fact the ion core formed by 


bound states. Inside this core region the functions P, form a com- 
plete set in the sense that for r < Re andr’ < Re (cf. (2.10)) 


2 Dž (r') D, (r) =ô (t —r). (2.129) 


Let us define U,(r) as 
—Vi(r), r<R,, 
AOE TO 9 


As a result, we obtain 


(k+q|W |k) = (k+ q|V|k) + (k+ q|U|k) 


4 ° . . 
=a e-itk+ory (p)eikt d3r, (2.130) 
° outside the 

10n core 
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that is, the potential has indeed been stripped of all its interior 
region, which is responsible for the bound states! 

We have thus proved the following existence theorem for pseudo- 
potentials: an original potential may be modilied by adding a more 
or less arbitrary potential but from the scattering theory point of 
view it will remain the same. 

A similar theorem for conventional pseudopotential theory states 
that adding an expression of the form (2.124) does not change the 
eigenvalues of the Schrodinger equation. This is the Austin-Heine- 
Sham theorem [3, 72, 73]. 


2.4.4. Pseudopotential optimization. Because of the arbitrariness 
of Ua, a great many pseudopotentials are possible. The Austin- 
Heine-Sham theorem only tells us that a pseudopotential can be 
found but it says nothing about how it affects the electron gas. In 
the literature, it is often stated that an optimal pseudopotential 
should be chosen. One of the various optimization criteria is that 
there be no bound states. 

It follows from the argument in Sec. 2.3 that wave function oscil- 
lations inside the potential are due to the presence of bound states. 
But then if we require that there be no such oscillations and set 
up a corresponding pseudopotential, this will be suitable. If, further, 
we require that the function be as smooth as possible, the pseudo- 
potential will be the best achievable. This will also secure a good 
convergence of the perturbation theory series. 

This optimization criterion was suggested in 1961 by Cohen and 
Heine [74]. It may be applied both to model pseudopotentials and 
to the OPW pseudopotential (see Secs. 2.5 and 4.2). In the latter 
case the optimization procedure leads to a replacement of the un- 
known energy £ in Eq. (4.31) by an approximation [73] (Cohen and 
Heine, for example, derived a first-order dispersion law). Note that 
the difference between OPW pseudopotentials due to the difference 
between the initial dispersion laws can be ascribed to (or rather de- 
scribed by) movements of the ion-core levels £, caused by neighbor- 
ing atoms [75]. These displacements do exist and are sometimes 
quite large [76-78]. 

Unfortunately, the smoothness of wave functions is not an entirely 
satisfactory criterion. The point is that a potential which is weak 
from the point of view of perturbation theory may still not yield 
smooth wave functions. This was first noted by Pendry [79]. Con- 
sider the Schrodinger equation 


(—V2+V(r))¥ =EV (2.131) 
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where V is the crystal original potential. Let us replace V by an arbit- 
rary model pseudopotential (2.124): 


V—>V +p, (2.132) 
VRY => Ja) (Fal Y) (2.133) 


where F, is an arbitrary function and ]æ«) are core states. Now by 
substituting Eqs. (2.132) and (2.133) into (2.131), multiplying by 
W*, integrating over all space, and using the orthogonality of V 
(a conduction band function!) to the core states, we find that when 
calculating Æ (k) the result is the same as if we were dealing with 
the initial Schrödinger equation (2.131). Thus the Schrödinger equa- 
tion even with the best possible of the (2.132) type pseudopotentials 
leads to the same oscillatory crystalline function. 

Pendry showed [79] that the smoothness criterion is only valid 
for a non-Hermitian pseudopotential, for which the weakness of 
the perturbation turns out to be uniquely related to the smoothness 
of the wave function. The optimization criterion he proposed can 
be described briefly as follows. The replacement of the true potential 
by a general pseudopotential (2.133) displaces the Hamiltonian’s 
eigenvalues including the core ones. The new core levels are 


Ey, = Ey 4 (F a). (2.134) 


This is simply the expectation value for the pseudo-Hamiltonian 


(—v?+ V + Vp) taken for the | a) states. 

The perturbation series will converge most quickly if the bound 
states are eliminated. Thus, for a pseudopotential having a single 
core state, a sufficient optimization condition is 


E,= 0, (2.135) 
which leads to the pseudopotential given in [79]. 
W =V—E,|a) (a). (2.136) 


This potential is energy-dependent and Hermitian. 

Some other criteria have also been proposed. In [80] all the criteria 
except Pendry’s are comprehensively discussed and yet another 
criterion is proposed, namely, the electron density deviates least 
from its mean value. On the one hand, this is closely related to the 
idea of a smooth pseudowave function, on the other it is associated 
with the pseudopotential that least affects the electron gas (the 
‘minimum pertrubation’ pseudopotential, see Sec. 3.5). Perhaps the 
best approach would be to minimize the deviation from the mean 
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of the crystal density (taking screening into account). Such a pseudo- 
potential would have a minimum effect on the electron gas. So far 
this criterion has not been used. 

In Sec. 4.1 we introduce Pendry’s pseudopotential (2.136) without 
explicitly optimizing it. 


2.0. Scattering theory and potential formfactors 


[It was shown in Sec. 2.3 that the true potential can be replaced 
by a pseudopotential exactly, approximations only come with pertur- 
bation theory. 

While scattering theory does not involve formfactors, they are 
used in the band theory applications of the pseudopotential method. 
In Sec. 2.4 we derived (2.122) with formfactors inside the sum. The 
formula has already done its job, and we shall now discuss the use 
of formfactors in scattering theory from a somewhat different angle. 


2.5.1. Seattering amplitude. Let us reconsider the integral equation 
(2.4). The Green’s function (2.12) may be approximated for larger by 


7 t 1 i Sn ay re? 
Go(t, r)= —qzo sem men, (2.137) 
where we have introduced a wave vector for a scattered state, i.e. 


“= % ——, (2.138) 


lr | 





Using (2.137) and denoting 9, = |x), we obtain from (2.4): 


ixr 
ct 7? 


[r] 





F, = p+ f(x, x’) (2.139) 


t ¥ Q , 
f(x, we) = —A (VV), (2.140) 


where the factor 2 comes from the definition: | x) = Q-1”2 x 
exp (ixr). 

Equation (2.139) is a typical example of the “mixing” of an irregu- 
lar solution (at r — 0) with a regular one. The strength of scattering 
is characterized by the function f (x, x’) which is known as the scat- 
tering amplitude [38, 42]. This is a fundamental concept of scatter- 
ing theory. 

Let us define the lth component of the scattering amplitude (or 
the partial scattering amplitude) f; as 

f(x, *')= >) (2144) f, (E) P, (cos By.) (2.141) 


L 
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where we have used the normalization (2.56) and the fact that x? = 
(x)? = E. It can easily be seen that f, (E) is defined by 


trm 1 
fi (E) = VE tan n; (E). (2.142) 


If F, is taken in the complex (i.e., running wave) representation, 
the quantity fı becomes complex and is determined by Eq. (2.85). 
This is what is customarily done in works on scattering, although 
the right-hand side of (2.142) is then called the AK-matrix instead 
of the scattering amplitude; the reader should not be confused by 
this conflicting terminology. 

We now return to the Born approximation. It is clear from Eq. 
(2.140) that by replacing V,, by @, we have introduced a potential 
formfactor into scattering theory. If somewhat loosely, the Born 
approximation may be interpreted as replacement of the tangent of 
the scattering phase for a given potential by the formfactor of the 
potential: 


P(x, w) = Ew’ IVx). (2.143) 


Equation (2.140) holds for both weak and strong scattering, but 
to go from (2.140) to (2.143) is only possible for weak scattering. 
In order to justify the pseudopotential method in terms of scattering 
theory, a formula with a formfactor must be derived for strong scat- 
tering as well. How is this possible? 


2.5.2. The t-matrix. Let us define an operator t such that its action 
on an unperturbed function ọ* gives the same result as that for the 
operator V acting on the perturbed function F: 


tox =V Fy. (2.144) 
The scattering amplitude can then always be expressed in terms 
of a formfactor: 
f(x, x)= — (x! itia). (2.145) 
Obviously, t is a generalization of the concept of the pseudopoten- 
tial. 
The formfactors (x’ |t |x) form the ¢-matrix (though the same 


name is also applied to the t-operator). 
The t-operator obeys the integral equation 


tpu = V (Px + GoV Vx) = V Pu + VGot en (2.146) 
which can be solved by perturbation theory: 
t = V + VG,V + VG, VGV +... (2.147) 
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In view of (2.144), the Born approximation for t coincides with the 
original potential: 


t = Y. (2.148) 
By en a partial t-operator t, we obtain from (2.145) 
(x |t|x) = —— = a (21 +1) tan nP; (cos Ox, x) 
= >) (2141) t, (E) Pı (cos 0x, x’), (2.149) 
l 
4n 4a 4 
t (E) = — on — tan nz (E) = — Ox cotn, e (2.150) 


In the standing-wave representation, the t-matrix has singularities 
due to tan nı. In the running-wave representation, these singularities 
are only important for bound states since, by (2.145), the t-matrix 
has the form (2.85): 


A; 


z An 1 
1B) = — oe 


Ox Totn,— i ' (2.151) 


el sin nı = — 


This equation only has singularities when cot n; = i, that is, 
according to (2.84), when bound states arise. 

Unfortunately, in the running-wave representation the t-matrix 
is not Hermitian 


(x |t |x) Æ ((ult|x’))*, 


even though with standing waves it is (cf. (2.150)). 

We thus run into difficulties in trying to use the t-operator as a 
pseudopotential, since in the standing-wave representation the 
perturbation series may diverge because of the t-matrix singularities 
while in the running-wave case the band structure energy (1.40) 
becomes a complex quantity. 

We thus have the paradox that although pseudopotential theory 
is clearly a special case of scattering theory, we do not yet understand 
how to use the more general form (see also [17]). 1t is for this reason 
that pseudopotential theory is as yet a distinct discipline. 


2.5.3. The OPW pseudopotential in scattering theory. The difficulties. 
in using the t-matrix as a perturbation pseudopotential arise because 
the ¢-operator is actually an exact solution of the problem. If the 
solutions were modeled the operators could be Hermitian at each 
step of the procedure (as may be seen from Eq. (2.147)). 

Let us consider therefore the model approach to the scattering 
problem. 
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It can be seen from Eq. (2.148) that in the first Born approximation 
the ¢-matrix coincides jwith the potential. In Sec. 2.3 we used an 
integral equation to lintroduce the scattering phase (2.56) expressed 
in terms of #), the lth {component of the true wave function ¥. In 
the first Born approximation V is replaced by the plane wave: 


wat ebr, (2.452) 


Vo 





Consequently, the Z, in the definition of the phase is replaced by 
j, and we get 


tan n= —x \ ji (ur) V (r)r? dr. (2.153) 


There are two ways we can improve on the first Born approximation. 
We can either include higher terms in (2.147) one by one or improve 
the zeroth order approximation (2.152) by somehow including some 
information about the potential. The remainder will then act as a 
perturbation. The better we make the zeroth-order model, the smal- 
ler the perturbation will be and the more workable the pseudo- 
potential. 

This is another illustration of an approach generally used in phys- 
ics: an initial model is chosen so as to make the subsequent per- 
turbation treatment most effective. (In band calculations, for exam- 
ple, this occurs when choosing the best trial functions with which 
to form the secular matrix.) So can we improve the approximation 
(2.152)? 

Let us consider elastic scattering on a potential containing bound 
states; these latter are occupied and therefore unavailable for elec- 
trons. The true wave function V is the continuum state of the same 
Hamiltonian whose discrete spectrum contains the bound states. 
We can therefore require that the ¥ function be orthogonal to the 
inner states DM, (a being a set of quantum numbers n, l, m). The 
effect of the potential will then be taken, if partly, into account. 

The required zeroth approximation will thus be an orthogonalized 
to the inner states) plane wave (OPW): 


et 4S) pa, KO, (1). (2.154) 


a 





Xk = Vo 


The coefficient ux. is chosen from the orthogonality condition 
| DE (r) xe (r) ar =0, 
which gives 


tle, k= -y \ Dž (r) ekr d3r = — (ajk). (2.155) 


vl 
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The lth component of (2.154) with respect to k and r is 
Rar) & ji (xr) +È tn, pe, oa (VE), (2.156) 


where the radial function ¢,; is defined as @,,,, (r) = 
oni (lr |) Vx (x). 
Substituting (2.156) into (2.56) yields [81] 


tan n, (E) = tan nP—% J) pnt | ier) VP) dmi (r)rèdr. (2457) 
This expression may be rewritten in a generalized Born form: 


tan ns (E)= —x È | seer) (rs ry) jiri) har dry, (24158) 


where’ we have introduced the nonlocal pseudopotential 


Wi, ry) =V Or) [SD dar) P(r) |. (2159) 


TT, 


This is the same pseudopotential that we obtained from the Austin- 
Heine-Sham theorem and to show this, it suffices to evaluate the 
formfactor of expression (2.159): 


W (r, r= È Wilts ra) V2.2) Y (13), (2.160) 


(k +al Wk) = (k +a|V k) — È (k +q|V |) (|k) (2.161) 


We observe once again that the (exact) scattering problem for the 
true potential may be treated as a Born-approximated scattering 
on a pseudopotential. 

Using the equation for ®, and the Hermitian property, we find 
that 


{k + q|V|a) 
= ({a|V* + E, |k+4))* = —(eg—#,) (k + gla), (2.162) 

{k +q|W |k) 
=(k+qjV|k)— >) (Ea — £a) (k +qlæ) (ajk) (2.183) 

a, 
‘The expression for the formfactor is now simpler but, as can readily 
be verified, the formfactor remains non-Hermitian. 

Interestingly, this pseudopotential is independent of energy. 


This is due here to the model we adopted for .#,: had the orthogonali- 
zation coefficient been energy-dependent, so would the pseudopotential. 


2.5.4. R-matrix pseudopotential. In the (approximate) formula 
(2.156) no account is taken of the phase shift: for r > Re, Re being 


5—01063 
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the ion core radius, £&, tends to j, (xr). A more rigorous treatment 
by Hubbard [81] employed artificial extra orbitals [82] This ap- 
proach was proposed in [83] and is now known as the R-matrix method 
[84, 85]. The idea is straightforward. If in the continuous spectrum 
the expansion in terms of bound-state eigenfunctions is impossible 
and hence the powerful method of trial functions (see Chapter 1) 
is inapplicable, we assume bound states to be introduced! To do this, 
the solutions of the radial Schrödinger equation (2.25) must be 
subjected to an appropriate boundary condition at the edge of the 
potential (cf. Sec. 5.2). The continuum will then acquire bound 
states, these will form the basis set to expand ¥ and the variation 
principle will then be used to find the best solutions. 

It is reasonable to specify this boundary condition by requiring 


that the ba orbitals smoothly join to the irregular solutions, i.e., 
tpherical Neumann functions: 
1 dv _ 1 2 4) 
Pa (R) T Palr=R rı (*R) ar n (2) r=R (2.164) 
where œ includes both / and an analog of the principal quantum num- 
ber, namely, the index of the level which satisfies (2.164). 


According to (2.164), the eigenvalues of this problem, Ea, are 
energy-dependent, that is, for each energy of a scattered particle 


there exists a special set of functions ¢, (E£). For a given E-value, 
the corresponding set is complete on (0, R) for functions that behave 
well at zero and smoothly join to n, (xr) at r ~- R. To appreciate 
the elegance of the boundary condition (2.164), note that .#; (r, E) — 
jı (xr) is one of the functions that smoothly join ton; atr = R, and, 





as such, it can be expanded in terms of ¢, (r): 
Alt, E)=jrery+S Caba (ts E). (2.165) 


There is a close analogy between (2.165) and the OPW as given by 
(2.156). Substituting (2.165) into the Schrödinger equation (2.25). 


and using the orthonormality of ¢,, we obtain 


Ca = FEL, (2.166) 
where 
R 
B= j ii(ur) V(r) å (r) r2dr. (2.167): 


0 
From (2.165), (2.166) and (2.56) we find 


tan n: (E) = tan n8 (E)— x Sil Bal? | (2.168) 
a 





E— Eu 


Ch, 2. Scattering theory 67 


This expression has a strong formal similarity to Eq. (2.80). The 
role of the resonance halfwidth is played here by the quantity 
x | Ba F, with a correct low-energy behavior of ~x?! +t! (cf. (2.82)). 
Note that E is a function of energy, in accordance with (2.81). 
Transforming, once again, the tangent of the phase shift to the 
generalized Born form (2.158), we obtain, instead of (2.159), a new 
pseudopotential 
— V (r) dalr) 64071) V (ri) 
W (r, r)=V (r) ee > ee ~ (2469) 


a 


with a Hermitian but energy-dependent formfactor 


k+ V Vik 
(k+ q|W ky =k + qV jk) — SERGI IF | (2,470) 
l a 
By transforming Ba, we may obtain an expression with orthogona- 
lization coefficients similar to those of (2.155). To do this, we use 


the equation for Par integrate in (2.167) by parts and take into 
account (2.164). This gives 


B,=1 aR ÐR, By | pri) redr ANY 


x n, (xR) 


Suppose, all the states ba of the original potential are deep, so that. 


at the edge of the potential Pa (R) = 0. It can easily be seen that 
the formfactor for such a potential is 


(k +q!W]k) = (k+ q\V 1k) — 2 (E,—E) (k-+ qa) (ajk). (2.172): 


In the general case, the original potential V contains both deep- 
and quasi-discrete levels, so that the a-sum in (2.170) includes both 
terms linear in (2, — E) and singular at E = E}, that is, propor- 
tional to (E, — E). 

It is interesting to note that (2.172) looks like a generalization of 
(2.163). 

We recapitulate our discussion. First, the Born approximation 
(2.152) (cf. (2.113)) was shown to be equivalent to the perturbation 
theory approach. Using Egs. (2.152) and (2.156), we then proceeded. 
by modeling the scattering on the original potential; this is a very 
important point for a proper understanding of how the theories are 
related to each other. When substituting these equations into the 
exact scattering-theory expression (2.56), we required that it become 
a Born approximation or, in other words, that perturbation theory 
be applicable. It is this requirement that led us to the pseudopoten- 
tials (2.163), (2.170), and (2,172), 


5* 


68 Part 1. Pseudopotential methods. Theoretical principles 


2.5.5. Other pseudopotentials. The above procedure clearly suggests 
two more ways for constructing pseudopotential formfactors. One 
is to evaluate the Lloyd-pseudopotential’s formfactor (see (2.105)) 
directly. This will be the exact formfactor because perturbation 
theory will not be involved in the derivation. 

Alternatively, we can employ scattering-theory model functions 
as the functions in terms of which to expand the required wave 
function ¥; when substituted into the Schrödinger equation for ¥, 
this expansion will automatically lead to a pseudopotential. 

The first method gives rise to the KKRZ-formfactor (the form- 
factor of the Korringa-Kohn-Rostoker-Ziman method, see Secs. 4.3 
and 5.1), the second (using the OPW of (2.154)) leads to the OPW- 
formfactor (the formfactor of the orthogonalized plane wave method, 
see Sec. 4.2). Interestingly enough, the two approaches are generally 
considered to be entirely different whereas we have just seen both 
of them to originate from scattering theory. 

In Chapter 4 both methods are used and compared. 


Chapter 3 


Theory of potential 


3.1. Potential seen by an atomic electron 


3.1.1. Hartree-Fock equations. The motion of an atomic electrom 
is usually described in the Hartree-Fock approximation as outlined 
in Chapter 1. The Schrodinger equation for the ith electron takes 
the form [86, 87]: 


— VF; (r) + VEE, (r) = BEY, (1), (3.4) 
where 
A Z 2 p | Y; 2 
very, (r)= — [ Y; (r)+ | > \ ae d?r, | Y; (r) 





j 
WF (ry) Vi (r1) 

— > Y, (r) \ See (3.27 

3 


The first term in (3.2) describes the attraction by the nucleus. The 
second is for the Coulomb repulsion from the rest of the electrons; 
this is an ordinary multiplication operator where the summation is 
over all the quantum numbers of the occupied orbitals (principal 
quantum number z, orbital angular momentum /, magnetic momen- 
tum m, and spin quantum number o) or, for short, over all spin- 
orbitals. (At j = i we have what is called Coulomb self-action: 
the electron, as it were, repels itself.) 

The third term in (3.2), called the exchange potential, is again 
attractive and is an integral operator because the function W; ap- 
pears inside the integrand. The jth summation only includes the 
spin-orbitals of the same spin as the orbital V;. The term is known 
as the exchange self-action term. 

It can readily be seen that the j = i terms in the second and third 
sums of (8.2) are equal in magnitude but opposite in sign so that in 
the Hartree-Fock approach the Coulomb and exchange self-actions. 
cancel each other. A valence electron of a neutral atom may thus be 
said to move in the field of a singly-ionized atom, the potential it 
feels being produced by all the electrons except itself. 

Equation (3.2) can be written for each function W;. Each WV; 
depends on all the other orbitals. Equation (3.1) is solved iteratively 
until self-consistency is attained. 
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3.1.2. Orbital relaxation and electronegativity. Suppose an electron 
moves from one orbital to another. Since one of the spin-orbitals 
is thus changed, the potential (3.2) and, accordingly, all the other 
functions will also change. The change in atomic orbitals due to the 
transition of an electron from one to another is called orbital relax- 
ation. 

That electron transitions are accompanied by orbital relaxation 
is a major point in the physics of the atom. It is usually said that the 
energy needed to move an electron from the ith orbital (energy F;) 
to the jth (energy #;) is A = E; — E;. This, however, is only true 
if the energy of orbital relaxation is neglected (this energy is of 
order 1/Z and may therefore be quite significant). The statement is 
known as Koopmans’ theorem [88-90]. 

Orbital relaxation is of particular importance in light and hence 
low Z atoms. This is clearly illustrated by hydrogen. The hydrogen 
electron has an energy of 1 Ry. Supposing an HM atom captures 
another electron, what energy will be released? 

With the orbital relaxation ignored, the number is: 1 Ry. The 
correct answer is 0.055 Ry [91], an order and a half less. 

The effects of orbital relaxation are exceptionally important in the 
study of chemical bonds in a molecule or crystal. Consider, for 
example, the formation of the palladium hydride (PdH) crystal. 
One of the two atoms will “pull” the electron charge onto itself. But 
which atom? The hydrogen level is lower than the upper vacant 
level in palladium, hence an electron transition from hydrogen to 
palladium is energetically unfavorable. But so is the reverse transi- 
tion, since the hydrogen level would then be raised above palladium’s. 
Still the atoms are different and the electron charge is bound to move 
one way or the other. 

The ability of an atom to hold an extra electron may be character- 
ized by half the sum of the electron’s energy in the neutral atom, J 
(i.e., the ionization potential of the nautral atom) and its energy in 
the negative ion, S (i.e., the ionization potential of the negative 
ion). This quantity is called the atomic electronegativity: 


g = (I + S)2. (3.3) 


Hydrogen, for example, has J = 13.6 eV and S = 0.75 eV, hence 
& = 7.18 eV = 0.528 Ry % I/2. l 

If two atoms react, the electrons will move toward the more elec- 
tronegative one. There are several definitions of electronegativity 
now in use; the one expressed by (3.3) is due to Mulliken [92]. 

Note that we have implicitly introduced the notion of a fractional 
electron charge. The electron is thought of as a liquid that can flow 
from one atom to another thereby equalizing the atomic levels. The 
concept of electronegativity is thus closely related (although not 
identical) to that of chemical potential. 
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For atoms and molecules, the “liquid electron” is perhaps too 
rough a notion since their wave functions and, accordingly, their 
densities have atom-like oscillations or, in other words, the density 
of the electron liquid is highly nonuniform. For crystals, however, 
this model may prove quite reasonable, the more so since it forms, 
in fact, the ideological basis for the NFE model. 


3.1.3. Statistical approach. The atom-like nature of the wave 
function may be taken into account (if partially) by replacing V 
by the @ model functions such that the integrals in (3.2) become 
analytically computable. The answer is rewritten in terms of the 
model’s electron density Omoa (r), which is then replaced by the 
true density pọ (r). We thus take into account both the liquid-like 
nature of the electrons and their localization. 

If the model functions are taken to be plane waves, we shall actual- 
ly be dealing with an electron gas and shall be led to the so-called 
statistical model. This model, however, is not to be confused with 
the Thomas-Fermi approximation. In the latter, the (potential-form- 
ing) electron wave functions are (or remained) plane waves, whereas 
in the statistical model they are nearly Hartree-Fock atomic orbitals. 
“Nearly’—because the potential seen by the electrons is not a Har- 
tree-Fock one. 

In the past 20-25 years, the statistical approximation has become 
very popular and is now used in about 90-95% of all first-principle 
band-structure calculations. The reason is that the approximation 
dramatically simplifies the potential (3.2) while the error it intro- 
duces into the band structure is small. 

The value of any simplification depends on the extent to which 
the calculations are made easier. Looking at Eq. (3.2), which terms 
should be simplified? 

Let us rewrite the potential (3.2) as a multiplication operator: 


Ze? | ‘Py (01) |? 
v= H+ 3 | a en 


r 
3 





Y; Wry) F; 
_ 5 g 7 (Pi) F: (ra) d3r,. (3.4) 


— W; (r) | r—r; | 

I 
In the Hartree-Fock approximation, a potential depends on the 
state it acts upon. This dependence is due to the exchange term. 
What is more, the exchange potential V°* is singular where V; (r) = 
0, which makes it very inconvenient to use as a local potential. 
Clearly, Ve* should be the first term to be simplified. In the sta- 
tistical approximation, this turns out to be a rather easy task. Let 
Y; be replaced by the plane wave | k;). The exchange potential (the 
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third term in Eq. (8.4)) then takes the form 


x 1 i —-k: f t J 
VE (x)= — D e f LT dry (3.5) 
j 


where the j-summation is over all the vectors within the Fermi sphere. 
The integration over r, is over all space, which allows us to change 
the variable while leaving the integration limits the same. Formally, 
the integral will take the form of a Fourier transform of the Coulomb 
potential, which we discussed in Chapter 1. Making use of (41.26) 
we get 





ex An 4 
EOS or a T (3.6) 


There being no preferred point in a free-electron gas, it is only 
natural that its exchange potential is thus independent of coordinates. 
(For a crystal, the translational invariance also makes this ap- 
proximation reasonable, although we should, in principle, take into 
account the nonuniformity of the electron gas near a nucleus.) 

For an atomic electron, the statistical approximation with its 
r-independent potential is unsatisfactory. We must therefore contin- 
ue with our program of transforming Ve* into a form containing 
the electron gas density (to be later replaced by the “true” density). 

We start by evaluating the sum in Eq. (3.6). Let us replace the 
k;-summation by integration. The coordinate axes in the k-space 
may be chosen so that k; will point in the z-direction. Changing te 
spherical coordinates, we get the integral 


R 





F 14 
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Fermi sphere 


The integral over d (cos @) will lead to one-dimensional k-inte- 
grals containing logarithms that can be integrated by parts. Using 
the substitution 











1 1— zr? 1 = 
F (2) = 4-43 n E, (3.7) 
(cf. (1.38)), we have 
Akg ki 
Vi=—- EP (= J. (3.8) 


This is a k-dependent potential. It has a singularity at k = kp 
which makes it inconvenient to work with and necessitates further 
simplifications of the exchange potential. 
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The first thing that comes to mind is simply to average V° over 
all occupied states (i.e., over the Fermi sphere). As a result, we come 
to the Slater approximation for the exchange potential [93]: 

k 
s=—3 a (3.9): 

If the potential is only averaged over the Fermi surface (i.e., the 
exchange interaction over the whole zone is assumed to be equal 
to that. of the Fermi-surface electrons) we obtain the Gaspar-K ohn- 
Sham exchange potential [94, 95]: 


sks = Va —2 AL. (3.10) 


The F (x) function decreases with z, so that the Gaspar-Kohn- 
Sham exchange is smaller than Slater’s. 

Since other averaging procedures are also possible, it seems reason- 
able to take this into account by introducing an adjustable para- 
meter & 

Vet = — 304, (3.44) 
For œ = 1 we retrieve the Slater exchange potential and at a = 2/3: 
we obtain the Gaspar-Kohn-Sham potential. The “correct” value of œ 
lies somewhere in-between. 

We can now return from the model functions to the initial spin- 
orbitals. To do this we must set kp = kp (r), by analogy with the 
procedure customary in all statistical methods. Specifically, we 
express ky in terms of the average electron density (we denote it as 
Omoa) and replace it by the true one: 


kp = (817Z/Q5)'? = (30D moa)? = (311p (1). 


The resulting expression is known as Slater’s [96, 97] X, approx- 
imation for the exchange potential (the letter X indicates that the 
parameter œ is unknown), that is 


VE (r) = — Sy, | 3a? È Ys (0) VFO) |". (3.12): 


Starting with the nonlocal state-dependent exchange potential 
(3.2) we have thus ended up with a local potential (8.12), which is- 
the same for all the states. 

We have derived (3.42) very much on the basis of intuition and 
following, in fact, the original argument of Slater [93]. He performed 
averaging over the volume of the Fermi sphere, which corresponds: 
to a = 1 in (3.12). To justify this approach, a rather complicated. 
formalism was later developed [94, 95]. It turned out that the vari-- 
ational formalism [95] leads strictly to « = 2/3 and never toa = 1.. 
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‘This discrepancy turned out to be associated with the order of actions 
‘taken in the modeling process. 

Let us discuss this point in more detail. The Hartree-Fock equa- 
‘tion (3.1) is obtained by varying the total energy. The order of the 
‘calculations when deriving (3.9) was therefore first to take a vari- 
ation of the total energy, and then to apply the statistical approxi- 
mation (thereby averaging over all states). By contrast, Kohn and 
‘Sham [95] started by writing down the total energy in the statistical 
approximation and then proceeded variationally by constructing 
a one-electron equation with a potential averaged over all the states 
‘within the Fermi sphere. These two operations, namely the variation 
of the total energy and replacing the exchange potential by its sta- 
‘tistical approximation, are thus not “commutative” [87]. 

The Gaspar-Kohn-Sham approach is internally consistent and 
rigorous and is therefore being extensively used at present in various 
modifications of the Xa approximation. 

Naturally we want to know how the state-dependent Hartree-Fock 
‘potential of (3.2) can be replaced by the state-independent Xa poten- 
‘tial, for these potentials have long been known [98] to be very different 
from each other due to the large differences among the Hartree-Fock 
potentials that are defined for different states. The answer [87] 
‘is that a change in potential changes the wave functions and energy 
‘eigenvalues, but a Coulomb potential is not much affected by varia- 
tions in the wave functions because the average density (hence the 
‘Coulomb repulsion) remains virtually unaltered. For the exchange 
potential we have, approximately, 


[Vir (r) — Eža] Wee (nr) = [VIF (r) — EP] WEF (e) (3.43) 


which in fact answers our question. 

We have thus encountered once again the idea of a pseudopotential. 
In particular, we see again that a potential and a pseudopotential 
are not directly comparable; but their effects on the corresponding 
wave functions are comparable. 

It is interesting that whatever a the kinetic and potential energies 
in the X, approximation satisfy the virial theorem [87]. This is 
‘very important for we can thus be confident that the approximations 
made do not “violate the equilibrium” of the system. 

It should be remarked that the X, method has several serious 
disadvantages. To begin with, the application of the statistical 
approximation to the exchange potential violates the compensation 
between the Coulomb and exchange potential self-actions. Even the 
hydrogen atom (with a single electron) acquires an exchange po- 
tential! Certainly, the Coulomb potential for the electron “self- 
repulsion” is also considered, but the r-dependences of these potentials 
care different. Near the nucleus, the Coulomb repulsion always exceeds 
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the exchange X, attraction (for reasonable œ’s) while at large dis- 
tances the X, exchange potential falls off more slowly than the 
Hartree-Fock one. As a result, the large-distance potential due to 
an atom is not a Coulomb one. 

In the X, approximation for the total energy, the two self-action 
contributions cancel out, but the difference between the potentials 
affects the energy eigenvalues. In the hydrogen atom, for example, 
the X, approximation, even at a == 1, yields an energy of about 
0.6 Ry for the ground level instead of 1.0 Ry as it should. At lower 
a's the exchange attraction is weaker, the Coulomb repulsion being 
unchanged, which raises the level still higher. 

To improve the X, approximation, Latter [99] proposed to cut off 
the exchange potential at large distances while somewhat renormaliz- 
ing it at small distances. With Latter’s correction, the levels are 
markedly depressed (see below) even though they fail to reach the 
Hartree-Fock positions. Regrettably, the correction violates the 
virial theorem, i.e., it strongly affects the average shape of the wave 
functions. At present, Latter’s correction is considered unsatisfactory 
{100] for calculating the atomic wave functions to be used for con- 
structing molecular and crystal potentials. That the Latter correction 
affects the eigenvalues should be remembered when comparing a 
band structure with the corresponding atomic levels, because the 
correction changes both the absolute and relative values of the atomic 
energies (for example, the separation between the s and d levels in 
transition metal atoms, see below). 


3.1.4. Electronic levels in the Xa approximation. The literature 
provides tables of atomic wave functions, and potentials and energy 
levels for all the elements, calculated by the self-consistent X, 
method with the Slater exchange (a = 1) and Latter’s correction 
{101]. The computed one-electron energies are in reasonable agree- 
ment with the Hartree-Fock eigenvalues throughout the periodic 
table. As to the wave functions, however, it turns out [87] that the 
agreement is only achievable for a < 1. 

The implication is that the X, method is not self-consistent. It 
should be noted that even in the Hartree-Fock model the difference 
between energy eigenvalues is not identical to the energy of the 
transition between the levels. The more so in X, method where the 
eigenvalues obey the “pseudopotential” condition (3.13). But then 
the eigenvalues in the X, and Hartree-Fock methods have different 
meanings and, accordingly, cannot be compared. 

To see the physical meaning of X, electron energy levels, it should 
be remembered that we are dealing with the statistical approximation 
in which, unlike the Hartree-Fock scheme, the occupancy of the 
states is described by continuous variables. It follows that the one- 


electron energy of the ith level EX is determined as the deri vative 
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of the total energy Uža with respect to the occupancy of this level, n;: 





E a=- Uža, (3.14) 


G ôni x 


But this means that we come back to the notion of electronegativ- 
ity. To see this, note that, by definition, 


x - 

I; = Ue (ni, <s Rijs J — O54 (m, . oe, ni — Â, ...), 
x x 

S = UZS (My, oo ey METS, ©...) UGA (My, oy Mey ++ +) 


where 6 is unity in the Hartree-Fock method and may be small in 
the statistical approximation. We have 


1; 7S; =U (ry, oy nis 6.) UZE (M4, 0 Ri — s, aa). (3.15) 


Expanding the mght-hand side around 6 = 0 we see that the Xa 
eigenvalues may indced be interpreted as the electronegativities of 
given states: 


r 


Bra tS 6, (3.16) 


This means, in particular, that in numerical estimates, the Xa 
energies of the valence levels cannot be identified with the centers 
of gravity of bands in crystals, nor with the positions of the quasi- 
bound states. Even so correlations between these energies do exist 
and may be useful for qualitative estimates. 

In X, calculations of level-to-level transition energies one should 
take into account orbital relaxation. The needed formalism turns 
out to be quite simple. It may be shown [87] that the transition energy 
is determined very accurately if the electron is thought of as being 
“smeared” between the initial and final states. It is assumed that 
“half” the electron has already moved, as it were, to the jth level so 
that we have n; + 1/2 instead of n;, while the other remains on the 
ith level (n; — 1/2 instead of n;). This fractional occupancy of orbit- 
als is then used for a self-consistent energy-structure calculation for 
an atom or molecule. The difference A = E; — E; will give the 
transition energy between the ith and jth levels with a better accuracy 
than the analogous difference between the Hartree-Fock values. This 
approach is known as transition state theory [87]. The notion of 
a transition state is closely related to the interpretation of the ener- 
gies as orbital electronegativities, since it takes into account that 
all the levels are, on the one hand, lowered because the ith electrons 
weaken the screening of the nucleus and, on the other, raised because 
the jth electrons make it stronger (cf. Sec. 3.4.7; also see [102)). 

A self-consistent calculation using fractional occupancies might 
seem a painstaking task. Our experience shows, however, that even 
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with a relatively low-speed computer (such as an IBM-360/44, about 
80,000 operations per second), the calculation takes no more than 
5 to 7 minutes even for a transition element. 


3.1.5. Sensitivity to a. The values of œ are either determined from 
some optimization criteria or fitted to experiment. As yet, no semi- 
empirical calculations of this sort have been performed for atoms, 
for crystals there are a number of results. The determination of a 
by “optimization” may be carried out in a number of ways (see 
{103-107]). The most frequently used are œ = 2/3, œ = 1 and the 
values given in [105, 106]. It is interesting to note that, whichever 
‘way œ is chosen, its value decreases across the periodic table from 
about unity to about 2/3. 

Note that due to the way it was introduced the exchange potential 
(3.12) includes the exchange of both the valence and core electrons. 
Strictly speaking, the exchange interaction between valence elec- 
trons alone and that between valence and core electrons should be 
described by different a-values [108, 109]: As a rule, however, this 
is not the case. 

In the theory of solids, the subtleties of the X, approximation 
are usually given little or no attention. The choice of œ is mostly 
arbitrary, which however does not prevent investigators from drawing 
conclusions from band calculations about reality. This is especially 
unsatisfactory for the transition metals where the relative positions 
of the s and d levels are very sensitive to the details of the potential. 
A detailed analysis of the effect of œ on band-structure would be 
inappropriate here. Several calculations of this type have been 
made [110-124], but, as we shall see in Sec. 4.1, the dependence of 
band levels on potential is interwoven with their dependence on 
the k-coordinate for which the levels are computed. In a way, the 
band levels are “pushed away” from the nearest empty lattice levels, 
the positions of which vary from point to point in the Brillouin zone. 
The real sensitivity of an energy spectrum to the potential cannot 
therefore be easily estimated from these data. 

In order to separate out the effect of the potential, we should con- 
sider the œ dependence of the atomic levels rather than the band 
ones. The reader is reminded that even though the X, atomic levels 
EXxa do not coincide with the Hartree-Fock £7!", there should still, 
as (3.16) implies, be a correlation between them (in the Hartree- 
Fock method, by the Koopmans theorem, J; = EF), 

Since no such calculations have been reported, we carried out 
a number of self-consistent X -calculations for the Cr atom using 
different values of a, both with and without Latter’s correction. 
Chromium is a typical transition metal whose valence-shell configu- 
ration, 3d°4s1, is well established. The results of the calculation are 
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plotted in Fig. 3.1 where, for comparison, the corresponding Hartree- 
Fock levels [125] are also shown. It is worth discussing these curves 
in some detail. 

First, it will be realized that spin polarization is ignored, so the 
d level is not split into sublevels and is ten-fold degenerate. A com- 
parison of the s and d levels shows that both the Hartree-Fock and 
X ,, methods produce a “hole” in the spectrum: the s electron, instead. 
of occupying a vacant place on the lower d level, is promoted to the 







Without Latter correction 
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Fig. 3.1. Sensitivity of the 3d and 4s valency levels in Cr to the exchange para- 
meter œ. The Hartree-Fock energy levels are shown by the dash-dot lines. 


higher s level. The reason why this looks rather odd is that usually 
we deal with local potentials that do not need self-consistency (that. 
is, do not depend on their own wave functions). In such potentials, 
the one-electron levels are occupied in the order of increasing energy. 
It may be argued that a spectrum has no. holes if the potential is 
such that orbital relaxation may be neglected in electron transitions. 

If we assumed the Cr atom has a 3d° 4s° configuration thereby trans- 
ferring the s electron down to the lower d level, the Coulomb repulsion 
would raise this level and the total energy would increase instead 
of decrease. It thus turns out that, even though the vacant d level 
is below the s level, the s-to-d electron transition results in an excited 
rather than a ground state. The 3d®s° configuration is therefore an 
excitation configuration. 
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A transition from ad"~*s* to d"~**1s*— configuration thus corresponds. 
to an excitation of the atom. The reason for this is that as the number 
of inner-region (or d) electrons increases the nucleus becomes dra-: 
matically less attractive, the increase in the exchange potential: 
being insufficient to compensate for the Coulomb repulsion. In terms: 
of electrostatics, this means that the nucleus’s Coulomb potential: 
becomes more screened when the inner region contains more electrons. 
Since the attractive force of the potential decreases in this region, 
the orbitals become less tightly bound: they rise to become more: 
diffuse. Since the d orbitals are more sensitive to the inner-region: 
potential, they clearly will be shifted more markedly than the 
s orbitals. 

We conclude that, for an excitation configuration, the d levels: 
(in a crystal, d bands) will be shifted up relative to the s levels. 
(s bands). This effect has indeed been observed [126]. 

An analogous phenomenon takes place in other transition metals: 
their one-electron spectra contain holes. In a crystal, the relaxation: 
of the Bloch wave functions is negligible since there are very many 
electrons and the change in the state of one does not affect others. 
[87, 89]. Hence the filled part of a crystal spectrum (below the Fermi 
level) contains no holes. Sometimes vacant states above the Fermi: 
level are also referred to as band holes. 

Another interesting point is that the atomic levels differ in their 
sensitivity to the value of a, the d levels depending more on «æ than 
do the s levels. It is tempting (and customary) to ascribe this to 
some specific influence the exchange potential has on the d electrons, 
and having done so move on. 

Some insight into the nature of this influence (which undoubtedly 
exists because all magnetic phenomena are due to the exchange. 
interaction) is provided by Fig. 3.2 where the 3d- and 4s-electron 
densities are plotted for the vanadium atom [116]. The Wigner- 
Seitz sphere radius Ra andthe Bobr radius a, (alias the radius. 
of a hydrogen atom, 0.53 A) are also shown. It can be seen that the 
valence s electrons are mainly kept at the periphery of an atom (s 
region) while thed electrons occupy the inner (d) region. Even in an 
isolated atom, the d density at r = A, is comparable with the s den-- 
sity. For comparison, we also show self-consistent computed partial 
densities for a crystal [116]. It may be seen that in this case s and d 
densities are displaced outwards. Moreover, there is some p density 
in a crystal in contrast to an atom where there is none. The reason 
for this is that the p phase shifts are nonzero (cf. Fig. 2.3). 

The 4s-electron density is highest in the outer regions of an atom, 
while that of the 3d electrons is highest in the interior. It should be 
noted that this correlates with the difference between the pseudopo- 
tentials of the 4s and 3d electrons. 
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To see this, recall that a chromium atom has core s states (which 
follows, by the way, from th eoccurrence of oscillations in o,,). For 
the 4s electrons we can introduce a pseudopotential by “cutting off” 
the core region of the potential. They will then be acted on only by 
the outer region of the atom’s potnetial, that is, the behavior of 4s 
electrons is again detormined by the details of the outer region of 
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Fig. 3.2. Self-consistent partial electron densities in vanadium for a free atom 
(bold lines) and crystal (dashed lines). The data are from [116]. 


the atom. In fact, we came to this conclusion in the framework of 
scattering theory, which led us in Sec. 2.2 to the concept of pseudo- 
potential. 

There are no core states for 3d electrons and hence no pseudopo- 
tential can be introduced. These electrons are acted upon by the 
whole of the atomic potential. Since the main feature of this potential 
is its Coulomb singularity at r — 0, it follows that for the d electrons 
it is the inner part of the atom which is important. This conclusion 
is based, once again, only on the concepts of pseudopotential theory. 

Let us now return to the X, exchange potential to see why it is 
d electrons which experience the specific influence of the exchange 
potential. The exchange potential (3.12) is strong where the electron 
density is large. The larger the electron density, the stronger is the 
exchange-potential attraction. Hence the density is pulled into the 
high-potential region and this, in turn, causes a further increase in 
the exchange potential. This density self-contraction continues until 
the attraction is compensated for by Coulomb repulsion. 

A change in a most strongly affects the attraction in the high- 
density (i.e., inner) region and only slightly affects the attraction 
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in the low-density (outer) region. Therefore, as œ increases, the 
d functions are pulled in more strongly than are the s functions, 
which results in the d level being more depressed than the s level. 
This is what we see in Fig. 3.1. 

Since in the X, method the Coulomb and exchange self-action 
terms do not cancel, it follows that the density contraction in the 
X a method must be stronger than it is using Hartree-Fock approach. 
As a result, the X, atomic wave functions must be more “shrunk” 
than the Hartree-Fock ones or, in other words, the electrons in the 
X method are more localized. This should be borne in mind when 
constructing a crystal potential from atomic X, functions: they must 
be made somewhat “blurred”. This may be accomplished (see below) 
by considering an excitation configuration and thus including more 
diffuse wave functions (it is not necessary to transfer the whole of the 
s electron to the d level). 

Concerning this shrinking of X , functions, it should be remembered 
that according to the calculations in [87] the agreement between 
the X, and Hartree-Fock functions is better for a < 1, i.e., with 
smaller X, exchange potentials. 

Increasing « makes the outer region less important (because of the 
density self-contraction) and thereby reduces the effect of Latter’s 
correction (see Fig. 3.1). 

Figure 3.1 shows the results up to a = 1.3. It may be seen that 
for this a, the Xa levels (without Latter’s correction) are in good 
agreement with the Hartree-Fock levels. The reason for this partic- 
ular choice is that at « = 1.3 the exchange potential (3.8) corre- 
sponds to the bottom of the free-electron band, i.e., at k = 0. 

To see this, note that by (3.7) 


F(k=0)=1, 
Rp 


| PF) Mdk=F, F(k=0)=5 Po, 


> 4n 
Fsi = Taji} 
0 


that is, the exchange at the bottom of the band is 4/3 times the 
Slater exchange. By the definition of the X, approximation, Eq. 
(3.11), this leads to a new value G@p=»: 


Qk=0 = 4/3. 

It will be remembered that it was, in fact, the same argument 
that led us to the quantity a,—,, for the Gaspar-Kohn-Sham ex- 
change: _ B 

F (k= kp) = 1/2, Fy = 3/4, F (k= kp) = (2/3) Fsi, 
that is, the Fermi-level exchange is weaker than the Slater’s one by 
a factor of 2/3: 
Ah=khp = 2/3. 
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The question of why the X, levels coincide with the Hartree-Fock 
ones at Gp» is beyond the scope of this section. We note, however, 
that the condition k = 0, which corresponds to a band bottom in the 
Statistical approximation, corresponds thereby to the lowest and 
most favorable level. 


3.1.6. The positions of the levels across a d period. According to 
(3.12), an increase in a is equivalent, in a way, to an increase in the 
electron density, which is pre- 

E,Ry dominantly due to d electrons. 
Figure 3.1 may therefore be con- 


N sidered as a model for describ- 
ing how the valence levels move 

“AA from one element to another 
across the periodic table. As the 

atomic number increases, so does 


the attractive force of the atom’s 

nucleus. This is not compensated 
\ for by the increasing Coulomb 
repulsion of the valence electrons 
and so the total attraction in- 
creases. As aresult,the orbitals are, 
so to speak, pulled into the inte- 
rior of the atom, the d orbitals 
relatively more so because the s 
electrons are screened from the 
nuclear charge by the increasing 
number of d electrons “below” 
them. 

Figure 3.1 thus illustrates the 
difference in the sensitivity of s 
and d levels to the change in 
the atomic number. 

Figure 3.3 shows s and d levels 
computed using the self-consist- 
ent X, method [98] for a = 1 and 
Fig. 3.3. 3d and 4s free-atom levels lLatter’s correction (the Hartree- 
along the d period, computed for Fock levels vary across a period 
x= 1 using moon correction in very much the same fashion). 

` It should be noted that in all 

elements a vacant d level is below 

an occupied s level, that is, there is no level inversion as in Fig. 3.1. 
It is clear that the s and d electrons differ in the way they depend 
on the atomic number, as we “predicted” on general grounds they 
should. Towards the end of a period, the d level is virtually inside 
the ion core. In this case, the energy gap between the s and d levels 
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is so wide that having accepted an extra electron from the s shell, 
the d level cannot rise high enough to make the transfer energetically 
unfavorable. It is for this reason that at the end of a period the atomic 
d holes become occupied. 

For all the elements except chromium and copper, the configura- 
tion is assumed to be d"~*s?, n being the number of the series. With 
chromium and copper (the configuration is d’~'s1), we can see how 
the occupation of one d hole affects the position of the levels: as the 
nucleus is screened more, its attraction reduces, and the levels rise. 

To summarize, although our topic in this section was mainly the 
X , approximation, all our conclusions as to the effect of the exchange 
interaction on the energy structure of atoms (and crystals) are inde- 
pendent of the approximation. The same is true of the sensitivity 
of s and d electrons to various regions in the potential. It may be said 
a priori, for example, that on forming a crystal it is the s electron 
wave functions that will change most because clearly the atomic 
potential will be altered most in the outer region, At the same time, 
the d-electron wave functions will remain very atom-like. Hence, 
even though the d electrons will be disturbed in the crystal formation 
process, they cannot be put on the same footing as the s electrons. 
In going from 3d to 4d and further to 5d metals, the core will acquire 
bound d states which will make the d orbitals more diffuse and less 
dependent on the inner region, i.e., they will become more s-like 
and hence a d pseudopotential will become possible. It then follows 
that a pseudopotential theory with s and d electrons treated equally 
would be more suitable for the heavy d metals (note, though, that 
relativistic corrections may be important there). 

It should be remarked that attempts to improve further the X, 
method continue [127-137]. 


3.2. Dielectric screening 


We shall in the next few sections discuss the modification of the 
atomic potential due to the formation of a crystal. 


3.2.1. The concept of a pseudoatom. We saw in Sec. 2.2 that wher 
a scatterer (atom or ion) is placed in a free-electron gas, the electron 
density around it is changed. The total displaced charge is equal to 
the charge of the scatterer Z which is to say that the introduced 
potential is screened by the electron gas. 

In a real crystal, the electrons screening a potential are not free 
ones. In the pseudopotential theory we have to ignore this difference 
because otherwise the problem of constructing a screened pseudopo- 
tential would be too hard to solve. In band-structure calculations 
using secular equations, the (crystal) potential-constructing pro- 
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cedure takes into account the atom-like nature of the wave functions 
of the screening electrons (see Sec. 3.4). In both models the original 
potential (placed in the crystal) and the charge that screens it may be 
considered as a single whole, i.e., as a solid-state analog of the atomic 
potential. This entity (the original potential plus the screening 
charge) is called a pseudoatom*, whether the potential being screened 
is a true or a pseudopotential. 

A pseudoatom differs from a real one, first of all, in its large- 
distance behavior. In a true atom the density decreases exponential- 
‘ly with r, the exponent depending solely on the atomic energy 
levels. In a pseudoatom the density falls off like r~? cos (2kpr) (see 
Sec. 2.2), which is much more slowly and depends, through kp, 
on the characteristics of the crystal. 

Let then Vi™ be the original potential (say, that of a certain ion) 
and Vt the screening potential of the electrons. The crystal poten- 
tial Ve" (the one due to the pseudoatom) is the sum of these: 


< yo — yio 4 ye (3. 17) 


This total potential acts on the electrons and in turn requires screen- 
ing. The resultant new potential also requires screening, and so on 
ad infinitum. It will be understood, however, that at each successive 
“iteration” the screening has a progressively smaller effect on the 
potential until the potential becomes self-consistent (i.e., such that, 
were it introduced as the original potential, the crystal would have 
no effect on it). 


3.2.2. Dielectric operator. We want to know whether a self-consist- 
ent crystal potential can be found in a general form without invoking 
a painstaking iterative process. 

We need to express V**" in terms of the final result of screening 
ver. We denote the density of the screening electrons as p**' and, 
for simplicity, we will use operator notation. 

At every iteration 


VT Ao, (3.18) 


where A is an operator. This equation expresses the obvious fact 
that a screening potential arises from the nonuniformity of the elec- 
tron-gas density. On the other hand, this departure from the mean 
is caused by the total crystal potential V™. In a general form thir 


* It would be more proper to call it a “quasi-atom” because first there would 
be no unnecessary associations with the pseudopotential, and second an indivi- 
dual atom within a solid is‘not a well-defined notion. “Pseudoatom”, however, 
is}the generally accepted term. 


Ch. 3. Theory of potential 89. 


may .be. written 


pst = BY™ (3.19) 


where the operator B is to be found later. 
The screening potential can thus be expressed in terms of the 
result of its action: ` 


vst =. ABV", (3.20) 


We are now in a position to subject the solution to a selfi-con- 
sistency condition (it should be emphasized that thus far this has 
not been done). We suppose, namely, that the iterative process has 
been carried through to self-consistency, i.e., V™ (that produces 
the screening charge) coincides with the crystal potential given by 
the sum of the original potential and the corresponding screening 
potential. We may then consider that V“ in (3.17) coincides with 
its counterpart in (3.20). Finally, from (3.17)-(8.20) we get 


vt = V4. ABV, (3.24) 


Note that the crystal potential is thus recursively defined. Another 
option we might choose for constructing Vë is to employ a model. 
We shall meet this approach in Sec. 3.3 and we shall see there that 
it is more versatile than the present scheme because the mathematical 
complexities of the self-consistent screening method make simpli- 
fications unavoidable. 

Let us introduce the operators 


e=1—AB and e1=(t—ABy*, (3.22) 
Obviously, 


ever — yion and vr = e-tyion . (3.23) 


The operator e is called the dielectric operator and equations 
(3.17)-(3.23) form a basis for what is known as the dielectric form- 
alism. They are applicable to both potentials and pseudopotentials 
since there are no restrictions on the strength of the original potential 
Vion , However, the computational difficulties encountered in dealing 


with g necessitate small-parameter expansions and, accordingly, 
the use of a pseudopotential. The dielectric potential formalism is 
only used in pseudopotential theory while in band-structure cal- 
culations* other methods for constructing crystal potentials are 
employed. We compare these approaches in Sec. 3.4. 


* Based on methods stemming from scattering theory such as the APW and 
KKR approaches. 
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3.2.3. The dielectric function € (q). Let us now discuss some con- 
crete approximations. We shall use the matrix representation of 
operators. The matrices of the operators Vi" and V% are formed 
from the potential formfactors. From (3.22) we get 


X k +a lela’) (q IVk) = (k + aly’ Ik). 
g 


In the general case the matrix e is nondiagonal. To set up a self- 
consistent potential V°T, we must find the inverse dielectric matrix, 
which is a rather difficult problem because as we shall see later the 
operator B is nonlinear in V. At present, the matrix e is inverted 
approximately [138-140]. 

In pseudopotential theory the simple assumption that the matrix e 
is diagonal is made. If nondiagonal -elements are in some way taken 
into account, one speaks of “nondiagonal” screening. With diagonal 
screening, then, we get the dielectric function e (k, q): 


(kt qlelq’) =e(k, 4) kta, g- (3.24) 


From (3.23) and (3.24) we now obtain the basic formula in the theory 
of dielectric screening: 


cr -l zion k - 
kalv = Sa (3.25) 


Conceptually, this formula is close to those used in electrosta- 
tics where to determine a field in a medium its “vacuum” value 
is to be divided by the dielectric permittivity of the medium. The 
dielectric function e (q = 0) is in fact the dielectric permittivity. 
For metals, e (q = 0) = œ which means that external fields are 
completely screened. 

We now turn to the evaluation of the matrices A and B. In differ- 
ential form (and the CGS unit system) the Poisson equation (3.18) 
has the form 


VV (r) = — 4ne% (r). (3.26) 


Fourier-analyzing the potential and density, we have 





ser 4s 2 ‘apr 
ve (q) =r 08 (q), (3.27) 
Ale! Ane* 
(k-+q Ala’) =F Sicha, at (3.28) 


To evaluate B, p°" should be determined in terms of the pertur- 
bation potential V°. To do this, we write down the first-order wave 


function YK, construct the crystal density (with spin taken into 
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account): 
m= 2 2 EOP (3.29) 
and subtract the original density from pt to obtain pscr, 
Thus, 
kigq|Ver|k 
Yk (r)= 1k) — Dy es BR, Ik +q), 
q0 q 
` ({k+a verik) ; 
et (r) = p? (r)— (21) | ` [or e qr (3.30) 
ASh Ep qæ0 
_(k [| Ver | k+ , 
Di eiar | dk. 


In fact we derived this expression in the section on scattering 
theory (Sec. 2.4). It can be obtained by combining Eq. (2.112) 
for the wave function with the Born approximation (2.120). The 
q sums in (3.30) will then arise from using Green’s function (2.122) 
rather than from first-order perturbation theory. 

Bearing (3.25) in mind, we should go over to reciprocal space. 
The sums over q will then vanish: 





2 | (k-+q | Ver | k) 
(ny eg—-k 
RSR p 


PT (q) = po (4) Bk 





2 | (k | Ver | k— q) Bk. 


— (27)3 e„— k? 

h<hp q 
The “summation” in the integral is over all k; the third term remains 
unchanged on changing the sign of k and becomes Hermitian-con- 
jugate of the second term. For Hermitian potentials these terms will 
be equal to each other giving 


4 kq] |k 
set (q) = — as f Akg | Ver | k) dk. (3.31) 


€ — k2? 
hc<h q 





F 


We have thus succeeded in expressing p" in the desired form 
given by (3.19). If the formfactor depends on k, the integration can 
actually be performed only if the dependence is known. Note that 


not only is B nonlinear in Ve (due to perturbation theory) but, for 
nonlocal potentials, it is also an integral operator. 
For a local potential, (3.31) simplifies since (k -+q | VeT|k) = 


Ver(q) and Ve (q) may be taken out of the k-integral. We shall 
then be left with the Lindhard function we met in the Introduction, 
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and Eq. (3.31) takes the form 





px (q)= 4 V% (q) x(q). (3.32) 
Finally, we get 
e()=1- 31 (ae) » (3.33) 
where 
4(2)= — (+ n|] (3.34) 








The function (3.33) is called Lindhard’s dielectric permittivity 
(141, 3] although it was Bardeen who first derived it [142]. It is 
also referred to as the Hartree function because the underlying 
assumption in constructing it is that the emergence of a scattering 
density results in a purely Coulomb potential Vt, What actu- 
ally happens (and must be taken into account) is that a change in 
the electron density alters the exchange potential of the electron 
gas. The change in the correlation potential should also be allowed 
for. 

The relevant theory, however, is beyond the scope of this book 
and we shall only reproduce its main results (for details, the reader 
is referred to [73, 17, 143, 144]). 

The inclusion of an exchange-correlation potential leads to re- 
placement of A (q) by A (q) p (q) with p (q) depending on the partic- 
ular model of the potential (the corresponding dielectric function 
is designated by an asterisk, that is, Eq. (3.25) involves e*) and 
we get 


e* (q)= 1 - “SS p(a)x (ge). (3.35) 





It is customary to use p (q) in the form suggested by Hubbard 
(445, 146], although quite a number of other formulas are also avail- 
able [147-158] (for a brief review, see [17, 159]). A recent analysis 
[458] indicated that the treatment of the correlation in [150] is very 
valuable. 

There are only few investigations [160-166] about how the com- 
puted properties of crystals depend on the specific model of many- 
electron effects. 

Note that since Ver (q) = Vi (q)/e* (q), the function a (q) 
defined in (1.39) by the ratio Vi" (q)/V¢ (q), in dielectric formalism 
becomes g* (q). Were the screening introduced in another way, 
x (q) would again enter Eqs. (1.39)-(1.41), but e* (q) would not. 
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3.2.4. The q dependence of the dielectric function. Let us now 
investigate how e depends on g. For small q, 


1 


x(Q)lor0= —(— EF), (3.36) 


and e(q) diverges as qq. 

At q = 2k, y is finite, but dy/dq has a logarithmic singularity. 
Although this latter is very weak and hardly even visible on an e (q) 
plot, it is nevertheless essential for some crystal properties, such as. 
the Kohn singularities in phonon spectra ([3, 73, 89]). 

AS q—> œ 


y% (a) |qro > — 2/8, (3.37): 


and æ (q) tends to unity, i.e., at large q (q >> 2kp) there is no differ- 
ence between Ve (q) and Vi" (q). 

We see that g (q) is a monotonic function which does not change- 
sign. Screening is important at small q. After a Fourier transforma- 
tion to real space, small q corresponds to large r, i.e:, screening 
removes the long-range nature of the Coulomb potential. 

If a purely Coulomb potential (1.26) is placed in a free-electron. 
gas, screening theory gives 


AnZe® 
(k+ q| V“ |k) = -oe AOT , (3.38) 


where (3.33) and (3.34) have been used and 
a? (q) = —4nZe?y (g)/Q > 0. (3.39): 


Equation (3.38) is simply a generalization of (1.25). If a is inde- 
pendent of g, we get the Thomas-Fermi approximation and the 
screened Coulomb potential is described by (1.24), i.e., it no longer 
is a long-range one of the type 4/r. 

The logarithmic singularity in e (q) gives rise to (spatial) oscilla- 
tions of the screening charge. This may be seen by taking the Fourier 
transform of (38.25): 


Py) (k-+q lVi” 1k) a-ig 73 | 
ve (r)=2 | +o e-iar dq, (3.40) 
Changing to spherical coordinates and integrating twice by parts: 
[73, 89] yields second derivatives of e (q) in (3.40). All the other 
functions depend on g only weakly and can be taken out of the inte-- 
gral. At the limit of r co we find 


r 32e ViOn (q= 2kp) cos (2k pr) 
(> Gp Dis) kr (3.41) 





that is, the potential and 4 charge density behave, in fact, in the 
same way they do in scattering theory (Sec. 2.2, Eq. (2.94)). 
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The oscillations have the same origin in both approaches, viz. 
‘the discontinuity in the electron energy distribution at the Fermi 
‘level. In scattering theory, this discontinuity appears in a direct way, 
‘whereas with dielectric formalism, it comes about as a singularity 
in e (q) at q = 2kp, which then manifestsitself in the Fourier trans- 
form of (3.40). 

It would be wrong to argue that at large distances from an ion, 
only oscillations of the type (3.41) will be present. It may be shown 
{167] that a rapid decrease of e (q) with q (even with the q = 2kp 
singularity ignored) also leads to oscillations in Ve with r, but 
these will be modulated by an exponential rather than by r-3, that 
is, the use of (3.37) gives rise to oscillations in the tail of the Yukawa 
potential (1.24). 

If the potential V° has a resonance state with an energy Ea, 
‘the electron energy distribution drastically changes in this vicinity. 
In real space, we should expect then that the charge density and 
Ver have extra oscillations, similar to those in (3.41), but with 


Ap replaced by V Ey. This result is due to Rennert [168]. 

It follows from Eqs. (3.41) and (1.41) that the indirect interatomic 
interaction potential @,, (¢) has Friedel oscillations. If there are 
‘resonance states, Rennert oscillations will also be associated with 
this potential. When calculating the properties of crystals, Rennert 
oscillations should be taken into account in constructing the pseudo- 
potential because they do not arise in a standard derivation of e* (q). 

Rennert oscillations can be incorporated in a model approach by 
adding to e* (q) a term having a logarithmic singularity at q = 
2VEq, Ea being a d-resonance energy. 

The same result may be obtained more rigorously by evaluating 
the dielectric function e* (q) as a matrix element [143] of the “sys- 
tem’s response to a perturbation” between wave functions that incor- 
porate d-electron effects. Such a procedure was suggested in [169, 
170] with wave functions taken from unhybridized s bands (i.e., 
from the NFE model) and d bands (i.e., from the LCAO model). 
As a result, in addition to the ss and dd interaction terms, an sd 
‘contribution to e* (q) appeared and the d electrons in narrow d bands 
may be strongly localized, their:response to a perturbation weak 
and the dd interaction in e* negligible. We are thus left with two 
terms in s*, one corresponding to the usual s-electron screening and 
the other to the screening by d electrons. The d-screening has a loga- 
rithmic singularity at q = 2k, and leads therefore to Rennert’s 
oscillations. 

This formalism was developed in [171-176] and it turned out that 
in the noble metals, the off-diagonal correction terms in e,q are small 
as compared with the diagonal terms [172]. This is very important 
since there is no a priori reason to believe that the diagonal approxi- 
mation is applicable to the transition metals. 
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The role of screening is extremely important. A pure unscreened 
Coulomb potential always produces bound states and the perturba- 
tion expansion will therefore be always divergent. This is easily 
shown for a pseudopotential of the type (2.100) (with V oc —r~?) 
by using the Bargmann criterion (2.109). Furthermore, a pseudo- 
potential with a long-range Coulomb tail cannot, strictly speaking, 
be factored as shown in (1.24). The reason is that the sum of these 
contributions diverges and termwise integration (only possible for 
absolutely convergent series) becomes illegitimate. This difficulty 
was pointed out in [177]. These singularities may be avoided by 
assuming some “background” screening leading to a short-range po- 
tential [177-182]. 

The formulas for linear dielectric screening are thus only true for 
relatively weak pseudopotentials which produce small perturbations. 

As a measure of the disturbance caused by a pseudopotential, 
we can take the single-ion screened pseudopotential averaged over 
the cell volume Q,. It is clear from Eq. (1.23) that this is simply the 
long-wavelength limit of the formfactor. To evaluate this limit, it 
suffices to let g go to zero in Eq. (3.25). Using Eqs. (3.33) and (3.36) 
we find for the pseudopotential of a screened ion We that 





lim (k + q|W"[k) = —= E}. (3.42) 
g>0 


Here care has been taken of the fact that since the ion pseudopo- 
tential Wio (r) behaves at r— œ like ~Ze?/r, its formfactor 
Wien (q) at small g is —4mZe?/(Q 99") (cf. (1.26)). 

We thus see that whatever the initial ion pseudopotential the 
crystal pseudopotential as given by linear screening theory (LST) 
will have the same mean value, namely, —(2/3)E$. The properties 
of an ion pseudopotential are thus taken into account in the screen- 
ing process and its “identity” is more or less lost. This is the reason 
why many pseudopotentials, each leading to the same result, are 
possible. 


3.2.5. Possible pseudoatom potentials. Let us see, qualitatively, 
which screened pseudopotentials correspond to which ion pseudo- 
potentials. To do this, we must go over from the formfactor of the 
crystal potential (3.25) to coordinate space. The explicit form of 
g (q), (3.33) is too complicated to use in analytical calculations and 
so it must be replaced by model functions. 

It will be realized that what we are now concerned with is not the 
behavior of the potential at infinity but rather at intermediate 
distances where Friedel oscillations do not yet appear. We can there- 
fore take a model function e (q) without a logarithmic singularity. 
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Let it be the Thomas-Fermi approximation (cf. (3.38)): 
3 4nZe? 


The factor 3/2 in the second term is introduced following [183, 184} 
to secure the correct long-wavelength limit (3.42). 

We.may choose the form of (2.100) suggested by scattering theory 
as the ion pseudopotential. This is a square well of depth A (i.e., 
V = —A) at r< R and a Coulomb attraction potential (V = 
—Ze?/r) at rœ R, R being a model radius. Two characteristic 
values for the well depth are A = 0 and A = Ze?/R (when there is 
no potential jump at r = R). 

A few words about terminology. are also appropriate. The model 
pseudopotential we are dealing with is known in the literature as 
the Heine-Abarenkov potential [3, 13, 17]. Its special cases are the 
Ashcroft (A = 0) and Shaw (A = Ze*/R) pscudopotentials. These 
potentials are local, but there are many nonlocal potentials with A 
and R depending on the orbital quantum number /. 

We return now to constructing a pseudoatom potential (recall 
that the pseudoatom is a combination of the original and screening 
potentials). Our aim is to see how the shape of a pseudoatom po- 
tential depends on the assumed well depth. With this knowledge 
we shall be able to analyze all the possible forms of pseudoatom that. 
can arise in dielectric screening theory. 

First we evaluate the formfactor of our model pseudopotential 


win Q=— q (Ze*— AR) os gR+AsingR (3.44) 


Substituting (3.44) and (3.43) into (3.25) and passing to coordinate 
space, we obtain the screened pseudopotential W (which is a 
crystal single-site potential alias a pseudoatom potential): 





cr r [ [A(1+b)— As] e sinh (ar), r<R, 
W` (r) = —const — . “or 
[As cosh d+ A (b sinh d— cosh d) e, r>R, 
(3.45) 
where 
6nZe? — S S 
a= ET d=aR, b= 7? (3.46) 
and 
Ze? 


is the well depth for the Shaw pseudopotential, 
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Table 3.1. Ion and_pseudoatom potentials as functions of the model depth 


No. Unsereened Screened 
pseudopotential pseudopotential 
(ion) (pseudoatom) 
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It can easily be seen that W% (r), taken as a function of A, in- 
volves two more characteristic values of well depth: 


Aga = Ag(1+b)", (3.48) 
Aya = 4g (1—b tanh d)“. (3.49) 


(i 


| 





A 


If A = Ag, neither the pseudoatom nor the initial Shaw potential 
have jumps at r=R (Table 3.1, No. 4). With A = Apa, W“ (r) is 
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identically zero (see No. 2). By analogy with the term “empty core 
potential” sometimes applied to the Ashcroft potential, we have 
introduced here the “empty atom (EA) potential”. Note the similarity 
between the pseudoatom with Axa and the pseudoatom of the phase 
function method, cf. (2.99). 

If A = Axa, the function W (r) is zero for rœ R, i.e., the 
ion potential is completely screened (see Table 3.1, No. 6). Outer 
electrons are neither attracted nor repelled by this pseudopotential 
for electroneutrality is already achieved within the sphere of radi- 
us R. This case we denote as the “neutral atom” (NA). 

As A goes from minus infinity to plus infinity, the square well 
changes from being infinitely repulsive to infinilely attractive and 
the pseudoatom changes its form as shown in Table 3.1. If A < Aggy, 
then for r< R the pseudoatom has a repulsive core, the size of 
which gradually increases in magnitude with A (Table 3.1, No. 1). 
For A > Ax, the repulsion arises in the outside region and in- 
creases with A (No. 7). 

lt should be pointed out that although there are no Friedel oscil- 
lations in our model, actual (not Thomas-Fermi) screening does 
produce such oscillations in the “tail” of every pseudoatom. 

The above qualitative argument (due to Shaw [184]) is supported 
by some quantitative data. For example, Meyer and Young [185] 
did a direct numerical calculation of an Ashcroft pseudopotential 
based on Thomas-Fermi screening. In the same article, the pseudo- 
potential was screened with a dielectric function taken in the Har- 
tree approximation and then the corresponding pseudoatom potential 
was calculated. The results agreed both with each other and with 
our qualitative discussion (Table 3.1). Rasolt et al. [186] numerically 
computed Hartree-screened local pseudopotentials for two values. 
of A; the form of the resulting pseudoatom is in accordance with 
Table 3.1 (Nos. 3, 5). It is worth mentioning that these calculations. 
exhibited no Friedel oscillations in the pseudoatom because the 
distances considered were too short. 

We thus have used the Thomas-Fermi model for the dielectric 
screening of a model pseudopotential. This is not the only applica- 
tion the model has in solid-state theory. When applied to a crystal, 
the Thomas-Fermi method in its classical formulation* fails to 
minimize the total energy with respect to the lattice parameter. 
The situation, however, is saved by including exchange effects 
[187, 188]. This scheme is known as the Thomas-Fermi-Dirac-Wei- 
zsacker (TFDW) method and has been used in a number of crystal 
studies. 


* The term “Thomas-Fermi method” is generally used in the context of solv- 
ing the Thomas-Fermi equation. In Eqs. (3.43)-(3.49) a different approach is 
taken. 
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In [489], for example, the TEDW method was combined with 
pseudopotential theory and the Wigner-Seitz radii for the first 
55 elements of the Periodic Table were calculated. In the majority 
of cases, the agreement with experiment was within 5%, even for 
the transition metals. When applied to nonmetals (such as Si and 
Ge) the method led to large errors, as might be expected. On the: 
whole, however, the agreement was quite satisfactory. The method 
was later applied to alloys [190, 194] and semiconductors [192]. 
The Thomas-Fermi method thus turns out to be a rather reliable- 
tool for calculating pseudoatom potentials. 


3.2.6. Interatomic interaction potential. To demonstrate the: 
applieations of dielectric screening theory further, we employ the 
Thomas-Fermi method for an analysis of the interatomic interaction 
potential (1.43) in the framework of the pseudopotential theory. We 
consider the simplest pseudopotential possible, namely the Coulomb. 
potential (—Ze?/r) with a formfactor (1.26) (this is in fact a Heine- 
Abarenkov type potential with R = 0). To incorporate screening, 
we use the Thomas-Fermi model (3.43). Some precautions should be: 
taken though because, as remarked earlier, this model requires the: 
inclusion of an exchange potential. Moreover, the function e (q) 
should be assumed to fall off faster than q~*. Let us introduce into: 
(3.43) an “average” correction due to purely Coulomb effects C and 
one for purely exchange-correlation effects X. We denote p? = a*C® 
and g? = a°X?. The dielectric function e (q) without exchange is 
e = 1 + p’/qg? and with both exchange and correlation effects it is 
e* = 1 + g/g. The function y (q) is simply —Qa?/8e?; the quantity 
a? is given by Eq. (3.46). 

The interatomic interaction potential has the form of (41.43) [47]: 

Z2e? 22 f ; 
(7) = + ae | LW (01 x(g) e (g) car ag. 
For the pscudopotential and screening chosen, the integral can be 
evaluated analytically, giving 


D (r) = [X21 bee. (3.50) 








This potential has no oscillations since we have used the Thomas- 
Fermi function rather than Lindhard’s. 

Our next concern is to find out under what circumstances Ọ (r) 
may have a minimum corresponding to the equilibrium interatomic 
separation in a crystal. The condition for the existence of a minimum 
is obtained by differentiating (3.50) with respect to r: 


(1+ gr)exp(— gr) =1—X?. (3.51) 


The left-hand side of (3.51) is a positive function smoothly decreas- 
ing with r; at r = O its value is unity. Hence, whether or not this. 
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‘crystal model is capable of giving an equilibrium separation depends 
on the right-hand side of Eq. (3.51). 

The quantity X? is, by the way it was introduced, an average of 
the function p (q) involved in (3.35). Whatever model is assumed, 
this function monotonically decreases from 1 to 0.5 or even lower 
{17]. We may therefore assume that 


Xe<t. (3.52) 


Consequently, if the exchange correction is included, the right- 
hand side of (8.51) is always less than unity and (3.51) is always 
solvable. If exchange effects are ignored, i.e., X = 1, there is no 
attraction between the atoms and no crystal forms, as is the case 
in the ordinary Thomas-Fermi model or inthe Hartree model 
[193]. 

Interestingly, even as simple a model as this makes it possible 
to characterize each element by its own “effective exchange”. Further- 
more, there may be then a quantitative agreement with experiment 
for each element, namely, the distance obtained by (3.51) coin- 
cides with the radius of the crystal first coordination sphere. 

Taking into account that 1 — X?> 0, Eq. (3.50) may be re- 
written as 

@(r) = B Aen y 


r r 


(3.53) 


with A and B positive. In this form, the interatomic interaction 
potential corresponds to the Born-Mayer potential, which is fre- 
quently employed in interpreting interatomic forces. In a sense, 
then, we have justified the Born-Mayer potential in terms of pseudo- 
potential theory. 

Note that the coefficients A and B in (3.53) have very much the 
same meaning they have in the theory of Born-Mayer potentials. 
A feature common to both approaches is that both the repulsion and 
the attraction are in (3.53) due to electron-electron interaction. 
There is a major difference, though. In the Born-Mayer theory repul- 
sion at small distances is due to exchange interactions arising be- 
tween atomic cores brought too close together. In pseudopotential 
theory, we see that this repulsion arises instead from the Coulomb 
term while the exchange is responsible for the stability of the crystal. 
If exchange is not allowed, no minimum will appear. 

Certainly the above argument on the role of exchange in the sta- 
bility of crystal structures can only be given a qualitative signific- 
ance. It does indicate, however, that the exchange-correlation inter- 
action may be very important in crystal studies. 


3.2.7. The screening potential and the convergence of series. It is 
appropriate here to raise an interesting point. We have used the 
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Coulomb potential, whose feature is the presence of bound states. 
If the potential is screened, bound states may disappear because of 
the additional Coulomb repulsion produced by the screening. The 
screened potential (pseudoatom) has the form 


We (r)= —2 





exp (— gr). 


The Bargmann integral (2.109) for the potential is 


2 702 
B,= Ze Ze 


“aE 

A screened crystal potential will have no bound states if B, < 1, 
i.e., X? > Ze?/a*. To estimate X*, we evaluate the quantity Ze?/a* 
making use of (3.46). We then find that for copper, for example, one 
should take X? > 1.53, for caesium X? > 3.04, for molybdenum 
X? > 5.53, all the figures disagreeing with condition (3.52). 

It thus turns out that the screened pseudopotential we used has 
a bound state. But then this is a paradox for, on the one hand, this 
pseudopotential allowed some reasonable conclusions about the 
stability of crystals and, on the other, as proved in Sec. 2.4, a per- 
turbation series diverges for a potential having bound states. A pos- 
sible answer is that the divergence only becomes critical when there 
are an infinite number of terms in the perturbation expansion. In 
other words, the divergence of an infinite perturbation series does 
not necessarily exclude the calculation of crystal properties because, 
in practice, the series is always truncated. (In Sec. 4.2.2, we shall 
discuss the use of the secular equation in OPW band-structure cal- 
culations. Strictly speaking, this equation is not applicable if the 
wave function is expanded over an infinite basis set. Nevertheless, 
the OPW method is workable, though not very accurate.) 

Another possible answer is as follows. We saw in Sec. 2.4. that. 
a perturbation theory series diverges for any isolated potential 
screened by the dielectric formalism. This being the way in which 
all pseudopotentials are screened, must we reject pseudopotential 
theory as a whole, ignoring thereby all its successes? Certainly not. 
The paradox is resolved, strange as it may seem, by the fact that 
the crystal actually contains an infinite number of such “bad” po- 
tentials. Their combined effect is to lower the total potential; the 
bound states then fall into the conduction band and are “destroyed” 
because the motion of a “bound” electron becomes infinite. 

To put it another way, there are an infinite number of ways in 
which the potential of a crystal as produced by all the pseudoatoms 
in the crystal may be decomposed into single-site contributions. 
Naturally, we can divide this potential into potentials with bound 
states. But perhaps we can build it up from identical single-site 
potentials none of which have bound states? Needless to say, the 
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convergence of the series may not depend on our choice of the way 
the potential is divided. It is important to realize that the pertur- 
bation theory convergence is determined by the total crystal pseudo- 
potential (rather than the single-site ones). 

One feature of screening is worth noticing. Depending on the 
“strength” (i.e., on the well depth) of the initial mode! pseudopo- 
tential, the screened pseudopotential acquires a repulsive region 
either inside the well (a finite region) or outside it (an infinite region). 
By the Bargmann criterion (2.111), it is only the attraction region 
which should be considered and we observe that depending on the 
value of A the pseudoatom may or may not produce a bound stale. 
As one might expect, the convergence of the perturbation series is 
determined by the well depth. On the other hand, the convergence 
criterion (2.118) requires that both the repulsive and attractive 
regions of the pseudoatom be considered and this may lead to entirely 
different results. Using the simple analytical form (3.45) we can 
easily evaluate the first moment of the pseudoatom (Sec. 2.4.1) 
and then, by letting it go to zero and applying Pendry’s criterion 
(2.135), we can obtain a new optimized pseudopotential. 


3.2.8. Screening in alloys. The dielectric formalism is particularly 
convenient when studying disordered systems. ln this case, all the 
information about the arrangement of atoms is contained in the 
structure factor S (q). Each ion is screened independently of its 
neighbors, i.e. all single-site potentials are identical. 

In reality, a screened single-site potential must depend on its 
position in a disordered crystal, but the linear dielectric formalism 
ignores this dependence. 

A similar situation occurs when pseudopotential theory is applied 
to alloys. A crystal total pseudopotential may be written 


caW a" (a) + enW 3" (q) 
pall ( 


W“ (q)=8 (q) (3.54) 


q) 


where Wi" and W" are the ion pseudopotentials of components 
A and B, respectively, and cy, Cg are their concentrations. The 
function ê! is determined from the equation Zê! =c,Z, + 
Cp pz. 

“Strictly speaking, the pseudopotential (3.94) does not allow sepa- 
ration of the contributions of each component, however conveniently 
its terms may seem to be grouped. This form of screened pseudo- 
potential results from the earlier assumption of linearity of screen- 
ing. To see this, consider the screening pseudopotential (3.20), 


ysr = ABW, where B is a “perturbation theory” operator. Had 
we used higher-order perturbation theory, V°*T would not be linear 
in the pseudopotentials of the components and, consequently, it 
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would be impossible to separate out the screening for each com- 
ponent. 
In the linear case this separation can be done and we write 


WS, s (q) = W's" p (q)/e!! (q). (3.55) 


This means that if dielectric screening theory is applied to alloys, 
the pseudopotential of each component is screened regardless of 
whether or not the crystal is capable of supplying enough electrons 
for the screening. 

We saw in Sec. 3.1 that, provided it is energetically favorable, 
an isolated atom may accept more electrons than the charge of the 
nucleus would seem to permit. An analogous phenomenon may 
occur in dielectric screening theory when applied to alloys. It may 
happen that in a crystal containing an impurity B (whose valency 
Zp is greater than the host valency Z,) the average electron density 
of the crystal will be higher than it would have been without B, 
and the lower-valency component will thus be charged negatively. 
But, as far as metals are concerned, we know that lower valencies 
correspond to the alkali and alkali-earth atoms and we must con- 
clude that these behave like acceptors of electrons, in contradiction 
to elementary chemistry which regards them as electron donors. 

In single-element crystals, such phenomena do not cccur because 
the average charge per unit cell corresponds to the ion valency. In 
pure metals, as we shall see in Sec. 3.5, the total number of accepted 
electrons is indeed equal to the valency. 

It should be expected that screening will be described most accu- 
rately in alloys made up of components with equal valencies. The 
difference between the chemistries of the atoms will be brought out 
by the formula (3.32): the deeper the initial pseudopotential, the 
greater must be the screening density, in approximate correspondence 
to the charge distribution in the alloy. Why approximate? Because 
the second component will also pull electrons to itself in competition 
with the first. This will result in the interatomic density being distrib- 
uted in some “compromise” way (an analog of the covalent bond), 
whereas in the dielectric formalism this region will necessarily be 
depleted of electrons. It is interesting to note that in calculations 
of short-range-order parameters (Sec. 7.3) [194] itis for components 
with close valencies that the best results are obtained. 


3.3. The self-consistency of pseudopotential 
and additive screening 
3.3.1. The problem of self-consistent potential. In the previous 


section, the theory of self-consistent (dielectric) potential screening 
was discussed. The theory is not self-consistent however. To prove 
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this we return to the dielectric screening problem for a “single-site” 
potential. 

Formula (3.25) should be considered as related to the total crystal 
potential 7° made up of single-site contributions. Using the factor- 
ization (1.21) it can be shown that (3.25) holds equally well for each 
individual single-site potential. This means that the formfactors 
of the individual ionic potentials are screened independently. But 
so too are then the potentials themselves because Fourier transforma- 
tion (to real space) is linear. The only effect neighboring ions have 
on a given ion is that all the ions contribute their Z electrons to 
a homogeneous electron gas. Each ion then seizes from this “store” 
however many electrons it needs, irrespective of what is happening 
to a neighbor. 

The total crystal density will, “of course, be a sum of the single- 
site densities since the Friedel “tails” of the individual pseudoatoms 
overlap: 


oc! (r) = 2 o(r — ty). (3.56) 


Thus there is an additional (earlier disregarded) density and this 
enhances the screening of the ion. 

Qualitatively, the same follows from the addition formula for 
the potentials: the “tail” of each pseudoatomic potential will reach 
inner parts of neighboring pseudoatoms thereby changing their 
potentials, in contradiction to the dielectric model which assumes 
the potential is self-consistent. But is there really a contradiction? 

It was stated at the end of the previous section that there are an 
infinite number of ways in which the total crystal potential can be 
decomposed into single-site contributions. It is clear that, in prin- 
ciple, this potential may be represented as a sum of nonoverlapping 
potentials; the summation will not change the single-site potentials 
(or pseudoatoms). 

Are the potentials of these pseudoatoms self-consistent? On the 
one hand, such a pseudoatom remains unchanged when placed in 
a crystal, so that the answer would seem to be “yes”. But on the 
other hand, each ion is screened by a homogeneous gas, and so the 
answer seems to be “no”, for having introduced nonoverlapping 
potentials we have placed, asit were, artificial barriers at the bound- 
aries of the pseudoatoms and made the interpenetration of the 
electron densities impossible. In a homogeneous gas, we would not 
think of boundaries around an atom and the density of such a pseudo- 
atom should therefore be “blurred”; it is too “compact” an entity as 
far as the boundary conditions for such a gas are concerned. 

It must always be borne in mind therefore that self-consistency 
implies subjection to some boundary conditions. A potential may 
be self-consistent for one boundary condition and may be not for 
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another. However evident, this point is frequently overlooked when 
discussing self-consistent potentials. 

We thus conclude that a potential screened using the dielectric 
formalism is self-consistent for an homogeneous electron gas and 
is not so for a crystal. This, of course, has nothing to do with the 
dielectric screening formalism itself but rather follows from the 
assumed linearity in the perturbation potential (3.31). 


3.3.2. Additive screening. The problem of self-consistent screening 
may be approached by a different method. 

In the dielectric method, the self-consistency of screening is required 
from the start (the screening density is expressed in terms of the 
final result) and then various approximations (or models) are invoked. 
We can reverse the procedure by first simplifying our equations (i.e., 
choosing a model) and next requiring self-consistency. This approach 
is advantageous in that it is more sensible (and easier) to improve on 
a well understood (chosen) model than to destroy the exact picture 
by using approximations that are sometimes hardly interpretable. 

We shall call this procedure the “additive screening” method since 
in Eq. (3.17) for the screened potential W°, the initial and screening 
potentials are additive. 

What is the difference then between additive and dielectric screen- 
ing? In both theories the crystal potential W°" is written as a sum 
of the initial (Virit) and screening (Vs) potentials. By (3.18). 
Vser is then expressed in terms of the screening density ps. In 
both cases we demand that the result be self-consistent, i.e., pS 


must be expressed in terms of W% using (3.19). Introducing C = 


AB, we obtain an integral equation for We that is valid for both 
approaches: 


Wo = init 4 ser (3.57) 
we = yt y owe, (3.58) 


In the dielectric formalism, we solve (3.58) exactly by writing 
the solution in the form 


WY -=(4—C)1y i, 3.59 
) 


We then simplify the operator (1 — C)? by assuming it to be dia- 
gonal, considering some terms as a small perturbation, etc. In the 
first approximation, solution (3.59), thus far “exact”, may be written 


aS a power-series in C, 
init , Ayinit , 
wer, — ym + cvi™ , (3.60) 


where Vinit is the zeroth approximation for |V°. 


102 Part 1. Pseudopoiential methods. Theoretical principles 


In the additive screening theory, the integral equation (3.58) 
for the (exact) self-consistent potential is solved iteratively, 


ryt CWE n, (3.64) 


where Wet, is the crystal potential at the nth iteration. The crystal 
potential before the first iteration (at the zeroth stage) is Virit, In 
the first approximation, then, the additive screening theory gives 
We = Vt ey", (3.62) 
which is the first approximation given by the dielectric screening 
scheme, Eq. (3.60). (We shall encounter this situation later, when 
comparing the dielectric and additive screening mechanisms as 
applied to an “electrically neutral” pseudoatom.) 
At higher approximations, the formulas yielded by the two theo- 
ries differ. With additive screening, the Nth iteration gives the Nth 
order perturbation theory term for the nonscreened potential (cf. 


(2.413) and (2.147)): 
we, =v 4. Cvinit y Cpinitoyinit p (EVY, (3.63) 
For the Nth term in the expansion (3.59) the result is different: 


Wein _. yinit + Cyinit + Ccyint +. 4 (CY Nyirit (3.64) 
The difference between these expressions arises presumably from 
the fact that the C operator is nonlinear and accordingly depends on 


the potential it acts upon. Therefore in (3.64) the operator C, in 
view of (3.58), is determined by the final self-consistent crystal 
potential Wer, whereas in each ith term of (3.63) it is determined 
by the crystal potential resulting from the ith iteration. 

Whatever the case may be, it is clear that calculating a self-con- 
sistent potential is equivalent to summing a perturbation theory 
series either like (3.63) or like (3.64). Hence the purely academic 
problem of perturbation theory convergence is closely related to 
the very practical question of the convergence of iterative procedures 
involved in constructing self-consistent crystal potentials. 

Note also that the extension of dielectric screening theory beyond 
the linear approximation, or “nonlinear screening requires includ- 
ing higher terms in the Wer- dependence of pT, i.e., the nonlinear- 


ity of the operators B and C. On the other hand, the question of 
whether self-consistent linear dielectric screening is equivalent to 
its nonlinear version has not yet been investigated. We can only be 
certain that self-consistency lies beyond linear screening. 

A comparison of (3.63) and (3.64) indicates that in additive screen- 
ing the convergence may be either faster or slower than with dielec- 
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tric screening. The additive scheme is however more convenient 
because the real-space treatment makes functional dependences 
much more transparent. It is therefore much easier to choose screen- 
ing models closer to the self-consistent result*. 


3.3.3. Self-consistency and electroneutrality. The next question to 
be addressed is the possible forms such self-consistent potentials 
may take. 

The number of electrons within a Wigner-Seitz cell must be equal 
to the charge Z of its ion. (All Wigner-Seitz cells are identical and 
were one of them charged, so would be all the others, which, by 
assumption, is not the case.) The electroneutrality of a Wigner-Seitz 
cell is thus a consequence of the translational invariance of a neutral 
crystal. 

Further, since electroneutrality secures the complete compensation 
of charge, the region outside a Wigner-Seitz cell must have neither 
an attractive nor a repulsive potential. Consequently, a single-site 
crystal pseudoatom obeying the “crystalline” conditions for potential 
self-consistency, should be “of finite range”: 


wrt (r)ii>iR ysl = 9; (3.65) 


where Rws is a vector at the boundary of the Wigner-Seitz cell. 
This formula excludes Friedel oscillations, which is wrong for a 
single scatterer placed in an electron gas, but is correct for an infinite 
set of scatterers (a crystal) where all the Friedel “tails” are identical 
and must be summed up. 

When single-site potentials obeying (3.65) are combined to form 
a crystal potential (which is the sum of them), the potentials of 
neighboring sites no longer affect cach other. A potential that obeys 
(3.65) is thus more “self-consistent” than one that disobeys it. Expres- 
sion (3.65) may therefore be considered as an optimization criterion 
for the potential or, more generally, pseudopotential. 

In practice, the surface of a Wigner-Seitz cell is too complicated 
a geometry for this criterion to be reasonably applied. To get around 
this obstacle it is customary to replace the polyhedral Wigner-Seitz 
cell by a Wigner-Seitz sphere of the same volume (which is the atomic 
volume 2, or the volume per atom). Denoting the radius of the sphere 
as Ra. we rewrite (3.65) in a spherically-symmetric form: 


W“ (r) lion, = 0. (3.66) 


* In band theory, the construction of an additively screened self-consistent 
potential takes a great deal of computational effort. First, the secular equation 
must be solved for E (k) . Then the electronic wave functions are found. These 
are used to calculate the crystal potential (the input potential for the next it- 


eration), and the procedure is repeated. The construction of V'"'* (“starting” po- 
tential) is described in Sec. 3.4. 


104 Part 1. Pseudopotential methods. Theoretical principles 


It should be remarked that spherically-symmetric pseudoatoms 
of radius Ra will overlap. To avoid this, Ra should be set equal to 
the MT-radius, which is half the nearest-neighbor distance (the MT 
potential was introduced in Sec. 2.2). But then all the charge Z 
will concentrate in a volume smaller than the Wigner-Seitz sphere, 
which is a priori a more restrictive condition than (3.65). Indeed, 
the MT-sphere is the one inscribed in the Wigner-Seitz cell and 
there is no reason to assume that all of the charge is pulled into the 
sphere. Perhaps R, should be considered as a model radius, but this 
possibility has not yet been investigated. We thus assume the model 
radius of (3.66) equals the atomic radius R, defined as the radius 
of the atomic sphere of volume Q, (the Wigner-Seitz sphere). 

Note that (3.65) in fact contains two conditions. The first (and 
stronger) one may be applied as a condition for screening: it requires 
that the potential be of finite range (incidentally, this requirement 
automatically removes the divergence in the long-wavelength limit 
of the formfactor). 

The second condition, the electroneutrality of the Wigner-Seitz 
cell, is complementary to the first. 


3.3.4. Comparison of screening models. It would be desirable to 
compare dielectric and additive approaches to the potential screen- 
ing problem. We restrict ourselves to potentials satisfying the opti- 
mization criterion (3.66). This will make it easier to estimate the 
reliability of our conclusions. 

Starting from dielectric screening, how can we construct a pseudo- 
atom that obeys criterion (3.66)? The only way to do this is to go 
from the formfactor of the screened potential (3.25) to its prototype 
and then to fit the potential parameters so that condition (3.36) 
is fulfilled. Clearly, the Friedel oscillations make this infeasible 
because within the dielectric screening approach the “tails” of pseudo- 
atoms will necessarily interpenetrate. 

Help comes from the problem we solved back in Sec. 3.2.5 while 
discussing screening in the Thomas-Fermi approximation. In this 
approximation there are no Friedel oscillations, so that a disadvan- 
tage from the dielectric screening point of view (for a potential embed- 
ded into an electron gas) becomes an advantage for a potential within 
a “community of its likes”. . 

Thus, by choosing Ra = (8Q,/4n)/3 and using the results of 
Sec. 3.2.5 for Thomas-Fermi screening, we obtain from Eq. (3.49) 
a condition for determining the well depth for a Heine-Abarenkov 
type pseudopotential. From the dielectric formalism point of view, 
this potential will be an optimized one. 

The next step is to set up an optimized pseudopotential within 
the additive screening approach. We reason that to retain consistency 
with the dielectric formalism, we may assume the screening electron 


Ch. 3. Theory of potential 105 


gas to be initially distributed over all space. This means that each 
Wigner-Seitz cell must contain exactly Z electrons. In coordinate 
space, their distribution density is Z/Q,. We may calculate the cor- 
responding Coulomb potential and then to add it to the original 
pseudopotential (3.44), the same we used in the dielectric screening. 
In this case we obtain an additively screened pseudopotential. 

By electrostatics we then easily obtain for the screening (repulsive) 
potential 








a, T(E) |. SRo 
Ze? 


r 


VS" (r)= (3.67) 





, r> Ra 


It can easily be seen that while at small r the behavior of the 
crystal pseudopotential is the same for both screening models 
(a — br), at large r the agreement breaks down. This might have 
been expected because the mathematical formalism we are using is 
different. 


To compare the pscudopotentials numerically, the well depth A 
of an additively screened pseudopotential should be chosen so that 
the long-wavelength limit of its formfactor be the same as with the 
dielectric screening. The two pseudopotentials will then have the 
same mean value and ranges but will differ in the well depths; it is 
this difference that will allow us to distinguish between the two 
screening models. 

Let us then evaluate using (3.67) the formfactor of the potential 
screened 


W (g) = W° (g) + (a) 


— cosqRa of J_ 24s 7] Ji (Ra) 2 BQ), 
= —3As Ps JE arn | a (8.68) 


At the limit q > 0 we obtain a relationship between the well depth 
and the long-wavelength limit of the formfactor: 


~ 








W“ (0) =1.24,— A. (3.69) 
Setting Wer(0) = —(2/3)E 7, we find the required relationship: 
A= 1.2Ag+—— E$. (3.70) 


As it must, this (optimized) well depth differs substantially from 
that given by dielectric screening theory, Eq. (8.49). This makes. 
the results of calculations by (3.70)* all the more significant. The 


__ o 
* Lack of space prevents us from quoting the numbers. If needed, they can be 
easily reproduced by the reader. 
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parameter A is invariably larger than A, but the difference is at 
most 3% and the agreement between the two theories is better for 
transition than for nontransition metals. 

There are three conclusions to be drawn from this. First, the sim- 
plification of the dielectric operator makes it hardly sensible to 
speak of self-consistent dielectric screening. For in the additive 
formalism, assuming a uniform screening density and thereby neg- 
lecting the effect of the crystal potential, we obtained the same result 
as given by the dielectric formalism which does incorporate this 
effect in Eq. (3.32). 

Second, the simplest additive screening method, while not inferior in 
accuracy, may prove more versatile than the dielectric, since it is 
an easy matter to replace the uniform screening density by a nonuni- 
form one (provided the integral for the screening potential is cal- 
culable). 

Finally, the agreement between the two screening methods as 
applied to a potential optimized with respect tothe electroneutrality 
of the Wigner-Seitz cell suggests that this criterion may be regarded 
as internally consistent and potentially useful. 

In addition to incorporating the nonuniformity of the electron 
gas, the additive screening method seems to have another major 
advantage over the dielectric screening method. Using the X, 
method or a similar technique we can explicitly introduce the ex- 
change interaction, including that between the core and valence 
electrons. 

The reader should be reminded that in the dielectric screening 
method, the exchange interaction is also taken into account but in 
a different way. The exchange interaction between the core electrons 
is included as part of the definition of the ion’s model parameters, 
the interaction between the valence electrons is incorporated in 
e* (q), while the core-valence exchange cannot be considered in a 
consistent way because the corresponding expression may not be 
included in the interaction potential between free valence electrons 
that determine e* (q). 

Although the additive screening method is frequently combined 
with the MT approximation in band calculations based on secular 
equations (the method has then some similarity to the LCAO method), 
its philosophy goes beyond this approximation. In the following 
section we shall discuss the way additively screened potentials are 
derived for the widely used augmented plane wave (APW) and Kor- 
ringa-Kohn-Rostoker (KKR) band calculation methods. A com- 
parison with the dielectric screening will also be made. 

In pseudopotential theory this method, whatever its advantages, 
has not been widely used, although one feature of it has been applied 
in a number of papers. This is the possibility of self-consistent (iter- 
ative) calculations in which each iteration gives, by (3.32), a screen- 


Ch. 8. Theory of potential 107 


ing density which is then transformed into coordinate space. Using 
Poisson’s formula we determine from this density the screening 
potential and using the X, approximation we find the exchange 
potential. Having added these two contributions to the initial ion 
pseudopotential, we evaluate the formfactor of the screened pseudo- 
potential, and the iteration process is repeated to -self-consistency. 

Calculations of this kind have been performed for model pseudo- 
potentials on semiconductors [195-200], transition metals [2041-207], 
semiconductor surfaces {200, 208-219], and transition-metal surfaces 


(205, 220]. 


3.4. Muffintin potential 


3.4.4. MT approximation. This approximation was introduced in 
1937 by Slater [221] in a paper on the augmented plane wave (APW) 
method. His idea was to make the functions involved in the scatter- 
ing theory more tractable. 

The essence of the approximation is the introduction of a sphere 
(MT or Slater’s) such that inside it the potential is spherically sym- 
metric and outside zero. It is implicit that MT spheres do not over- 
lap, that is, the MT radius is no greater than half the nearest-neighbor 
distance, dan- 

In this section the MT approximation is to a certain degree justi- 
fied. We compare it with the dielectric screening formalism and dis- 
cuss their relative strengths and shortcomings. 

Thus, in terms of Sec. 3.3, the principal idea of the MT approxima- 
tion is the use of nonoverlapping single-site potentials. 

Suppose the crystal is monatomic. We assume the total initial 
potential is the sum of the atomic (or ionic) potentials, as in Eq. 
(3.56) for the electron density: 


7 ON (r= DT (rt). (3.74) 
Vv 

Then in the directions to the first nearest neighbors, the quantity 
J 'init(d,,/2) will be smaller (the potential lower) than in the di- 
rections to the second nearest neighbors. The “anisotropy” in the 
nearest-neighbor distances thus gives rise to the anisotropy in crystal 
potential. 

But, as we saw in Sec. 3.3, any potential, when placed in a crystal, 
causes a redistribution of electrons in it. Clearly, so will potential 
(3.71). In this case the electrons will “flow” into the regions of lower 
potential, i.e., in the directions of lower values of Yit, To put 
it another way, the electron distribution will become anisotropic. 

But this anisotropy will lead to an anisotropy in the screening 
potential: the regions of minimum 7 "it will have maximum val- 
ues of 7 r, The screening (or rather the self-consistency of screening) 
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will thus result in smoothing off the anisotropy of the crystalline 
potential. Note that this effect must be more pronounced for ele- 
ments whose electrons are less tightly bound to their atoms, that is, 
for simple metals. 

The space distribution of self-consistent screened potential 7% © 
may be thought of as consisting of nearly flat regions between the 
atoms (where the extrema of #"™t and Z's? cancel cach other) and 
nearly spherical regions inside atoms, where the screening cannot 
compete with the atomic potential and F “ resembles the free-atom 
potential. Consequently, the self-consistent 7° may indeed be 
subdivided into a sum of single-site nonoverlapping contributions 
from Wigner-Seitz cell potentials VWS; 


7 (ry => VS t), (3.72) 
Vv 
where 
VYS (rl), O< |r] << Ryn 
VWS (7) = V, Ryr <r < Rws, (3.73) 
0, Rws< r< œ. 


Here Ryp must be less than or (as it is customarily chosen) equal 
to half the nearest-neighbor distance, dnn/2. Rws is a vector on the 
boundary of the polyhedral Wigner-Seitz cell. The shape of VWS 





Fig. 3.4. Potential of an isolated Wigner-Seitz cell (in two dimensions). 


(3.73) is shown in Fig. 3.4. The MT plateau and the spherically sym- 
metrical (or, in this case, cylindrically symmetrical) potential can 
be seen in the centre of the cell, both depressed by Vy relative to 
the vacuum zero. Above the plane the motion is infinite. The energy 
bands of the crystal lie below the vacuum zero, as illustrated in 
Fig. 2.5. 

That V (r) outside a Wigner-Seitz cell is zero follows from the 
requirement that these potentials do not overlap. On forming a 
crystal, the Wigner-Seitz polyhedra will tessclate and there will 
be no potential barricrs between neighboring cells for electrons whose - 
energies are between the vacuum and the MT zeroes. As a result, 
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the electrons are permitted to move unhindered throughout the 
crystal like “free” electrons. In this model, it is only the MT poten- 
tials which are significant. 

Consider now the Schrodinger equation with potential (3.73): 


(—V?4+V"* (r) —£) Y (r)=0, (3.74) 


where & is measured from the vacuum zero. The energy levels of 
the crystal lie at E < 0. We introduce a new variable 


Eur=E— V.. (3.75) 


This quantity is measured from the MT plateau and is positive for 
energies above it. Note that V < 0. Equation (3.74) may be re- 
written as 


(— V24- VS (r)— V, — Eyr) Y (r)=0 


It can be seen that, by shifting the energy zero, we can introduce 
a new MT potential Vmr defined as 


Vurr (r) = VS (r) — Vo. 


When an infinite crystal is being formed, we saw that the specific 
value of the potential outside the cell is immaterial, which enables 
us to write 


Vur(it}), 0< |r| < Rur, 


Vur()= | 0, Rur < |r| < œ. (3-76) 
A shift by V, could be applied to a crystal-potential made up of 
VWS potentials. The energy zero would then lie on the MT plateau 
and (3.76) for the MT potential would become self-evident. 
Equation (3.76) for a single-site potential is closely related to the 
condition for the potential optimization based on electroneutrality 
(Eq. 3.65). Equation (3.73) corresponds to this condition exactly. An 
MT potential thus reflects the electroneutrality of the Wigner-Seitz 
cell and, furthermore, incorporates self-consistency in that a self- 
consistent potential is less anisotropic than a nonself-consistent one. 
An MT potential is thus not as far-fetched as might at first seem. 


3.4.2. Construction of an MT potential. With a model potential 
chosen, we now turn to the potential construction proper (for additive 
screening, for dielectric, the MT potential was discussed in Secs. 3.2 
and 3.3). . 

Since, the MT potential is associated with screening by electrons, 
we obviously must add together the electron density “tails” of all 
of the initial atomic potentials (it will be assumed that even a non- 
self-consistent calculation yields the exact value of p°" rather than 
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a mere sum of atomic densities): 
o% (r) = 2) p% (r— ty). (3.77) 
Vv 


The above expression has the same anisotropy as Eq. (3.71). 
Let us average (3.77) over all directions of the vector r. This will be 
equivalent to resettling electrons from high-density to low-density 
regions, or to creating a more uniform electron density which, as we 
saw earlier, is reminiscent of self-consistency. We obtain 

o (Jr) =- 3 \ pt (r—t,) sin © dO dg. (3.78) 


An 
Ny 

The integral here can only be taken numerically. The usual tech- 
nique for doing this is to re-expand a function centered on one site 
in terms of functions centered 
on another. This scheme is 
due to Löwdin [12] and is 
usually referred to as Löwdin’s 
a-expansion. Since Mattheiss 
[222] was the first to apply 
this scheme to crystals, the 
following procedure of con- 
structing a crystal MT poten- 
tial is often called the Mat- 
theiss method (for detailed 
reviews, see [223, 224]). 
Fig. 3.5. Vector rformsan angle O with Since the final results may 


the vector from the atom at point Oto ee 
that at point A. As O is varied from 0 toa, seem somewhat doubtful, it is 


the projection of AQ (or the segment AQ’) Worthwhile to outline here 
changes from AN to AP. the main features of Loéwdin’s 


a-technique. 

Consider the density re-expansion from the vth to the zeroth site. 
Let the z direction be chosen along the axis of the two sites, as shown 
in Fig. 3.5. Since the problem is cylindrically symmetrical, the 
integration over @ will simply yield 2a. In integrating over © (from 
O to 2s) the z-projection of the vector AQ = r — t, will change from 





AN =|t,|—|r]to AP =|t,|]+ |r|. Since cos 0 = z/r and 
sin O dO = —d cos O = —dz/r, a O-averaging relative to the zeroth 
center is equivalent to z-averaging from |t, |— |r| to |t,|—+ 
|r |. After the necessary substitutions we obtain 
tytr 
p(n) =e" (r+ 5- = | erdridry (3.79) 
V=e=0 y tyor 


The first term here corresponds to the initial density of the central 
atom and the second term to the screening density. 
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Given the crystal density p“, the Coulomb potential is determined 
from the Poisson equation. The solution may conveniently be writ- 
ten as 





r Ra 
youl (r)= EEGEN ane’ \ (ry) ri dry + 4me* \ po (rı) ri dra, (3.80) 
0 r 


the sum of integrals resulting from the multipole expansion of 
(r — r) involved in the integral of the type (3.4). 
The exchange potential is evaluated by Slater’s formula (3.12): 


Ver (r) = — > a (372p (r). (3.81) 


Some workers [129-131] multiply V°** by the function F of Eq. (3.8), 
thereby incorporating correlation effects [225-229] and taking into. 
account the dependence of the exchange potential on the electron 
state it acts upon. Other attempts to introduce dependence on the 
state of the electron may be found in [230-235]. 

The shape of the potential at r < Ryr is thus known. Next we 
should determine the quantity V5. 

From the way it was introduced, V, is the potential averaged over 
the region between the MT sphere (where r < Ry) and the bounda- 
ries of the Wigner-Seitz cell. This is exactly the way it is usually 
evaluated [223, 224, 345, 346]. 

Another option is to apply the criterion for the electroneutrality 
of the Wigner-Seitz cell by averaging the density over the volume- 
[236]. The charge inside the MT sphere is simply 


RMT 
Qua == 40 0° (r) r? dr. (3.82): 


0 


The charge outside the MT sphere is then given by the difference. 
between Z, which is the total number of electrons, and Qy,7, so 
that the charge density outside the MT sphere is 


p = (Z — Qur)/(Qo—Qut). (3.83): 


With this approach, the MT approximation is applied to the 
density rather than to the potential. The important point is that 
in contrast to the additive screening scheme, which uses a homo- 
geneous screening charge (Sec. 3.3), the density in question is the 
total one rather than the screening density alone. A spherically 
symmetrical crystal density automatically brings about a spherically 
symmetrical crystal potential in accordance with the self-consistent. 
potential being nearer to the spherical symmetry than the initial 
one. 
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The MT approximation for the density leads automatically to 
the MT form of the exchange potential; the MT shift due to the ex- 
change is readily evaluated by using (3.81) and (3.83), under the 
assumption that (3.81) holds for all r. The expression for the Coulomb 
MT shift (due to the Coulomb potential) is rather cumbersome but, 
in principle, straightforward [236-238]. 

Other methods for evaluating V, are discussed in [222, 223, 239- 
243]. 

As the foregoing argument suggests, a specific value of V, may be 
found from some model assumptions about the MT potential, whether 
or not these are optimization criteria of the type (3.55). Vo should 
therefore be considered an optimization parameter (see Sec 3.4.4). 
This means that the conduction band bottom need not coincide with 
the MT zero. Even in the NFE model (Chapter 1) we saw that the 
use of perturbation theory resulted in lowering the bottom of the 
nearly-free-electron band relative to the energy zero (or, in that 
model, the vacuum zero). The conduction band may be either wholly 
above the MT plateau or partially below, in which latter case Eq. 
(3.75) will yield negative energies. 


3.4.3. Limitations of the MT model. The strengths of the MT 
model are (as often happens) its weaknesses. The finite-range nature 
of the MT potential results in a potential jump at the boundary of 
the MT sphere, which means by the Poisson theorem that there is 
a charge on this boundary [244] with a surface density 

1 dVur 
0 = Za dr =R . (3.84) 

Fhe existence of a surface charge follows very clearly from the 
MT approximation for density which has a jump at the MT boundary. 
It is this jump which forms the surface charge. The effect cannot be 
given any physical significance and is simply an artifact of this 
potential constructing model. 

It may be said that we have subdivided the screening density 
into two contributions, one of which is “smeared” within the MT 
sphere (no uniformity is assumed for the density) and the other is 
on the MT boundary. Perhaps we would be better off if the latter 
were also smeared throughout the sphere—if in no better way, 
then uniformly. Although computationally this would not be dif- 
ficult to carry through, such a model has not been tried. 

Although the subject of our discussion has been the modeling 
of a self-consistent potential with an appropriate MT form, we note 
that there is actually no self-consistency, and this not only from a 
numerical point of view (here the lack of.self-consistency is obvious 
if no special. iterative procedures are employed). The point is that 
the MT model is intrinsically incousistent. (Recall that it is gener- 
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ally used in APW and KKR band calculations.) The reason is that 
the consistency between the MT potential and the MT density is 
impossible in principle until the latter is zero at r œ> Ryy . This 
can easily be seen from Poisson’s equation, according to which the 
screening density is proportional to the Laplacian of the screening 
potential. If the potential is constant, the density equals zero 
rather than the p™ of (3.83). If the screening density is a (non-zero) 
constant, the screening potential is different from a constant and by 
the Poisson equation (cf. (3.67)) we obtain 


t 
v = So [— 2R + 3r R — r]. (3.85) 





It follows that even if p (r > Rymr ) is independent of r, Van: does 
depend on the coordinate, i.e., because of the MT approach to the 
density, the single-site potential Vur must also be coordinate- 
dependent in the region between the MT sphere and the boundary 
of the Wigner-Seitz cell. But this is inconsistent with the MT form 
(3.76) usually employed in calculations. (The reader will be reminded 
that the “self-consistency requirement” underlying the MT approx- 
imation implies a O-independent, and not an r-independent, po- 
tential.) 


3.4.4. Atomic sphere approximation. We have ascertained that the 
presence of the MT plateau makes a potential curve a poor approxi- 
mation. But is it not possible to do away with the plateau while 
retaining the finite-range nature of the single-site potential? Clearly 
we are again obliged (as we were in pseudopotential theory) to 
optimize the crystal potential. What then may the optimization 
parameter for the MT model be? 

To answer this question, consider the potential of an isolated 
Wigner-Seitz cell, (3.73). We see that Vo, the MT shift whose mag- 
nitude is to be chosen from some considerations, is just the parameter 
we need. The quantity V, is determined from the requirement that 
the electroneutrality of a cell, by analogy with the electroneutrality 
of the atom, is preserved. 

The potential shown in Fig. 3.6 by the bold line is for a pseudoatom 
of the MT model. The MT plateau in the pseudoatom replaces the 
true behavior of the potential at Ra > r œ> Ryr. If we reconstruct 
this curve, then at r = R, the potential will have a jump as U, = 
V (Ra) Vo- The physical meaning of the jump is transparent. 
In the absence of the jump (U, = 0) there would be too few electrons 
in the cell because the potential VWS — U, would not be deep 
enough. The pseudoatomic electronic levels would thus be positive, 
and the electrons therefore free to move throughout space and leave 
the cell. To “retrieve” them, we have to lower the potential (in this 
case, as a Single rigid whole) thereby localizing electrons inside. To 
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prevent this new charge density from destroying the “old” potential, 
these electrons must be distributed so as to make the resulting screen- 
ing potential the same square well we added to lower the initial 
potential. Clearly, the only distribution to satisfy this requirement 
is one that yields a surface charge, q.e.d. 

Now, however, the single-site potential is no longer isotropic 
because U, (Ra) takes on different values in different directions. The 


MO] 





Fig. 3.6. The potential of an MT pseudoatom. In the atomic-sphere model (Secs. 
3.4.4 and 4.5.3) the potential extends to R, (dashed line). 


departure of the Wigner-Seitz cell from a sphere should therefore 
be neglected. Replacing the true cell with a Wigner-Seitz sphere is 
known as the atomic sphere (AS) approximation: 


V (Irl), r< Ra 


Vas (r) = | 0, r> R.. (3.86) 

Although this approximation was suggested by Anderson [245, 
246] from a somewhat different position, his parameters have very 
nearly the same meaning as ours. 

The crystal potential is the sum of atomic sphere potentials 
(3.86). In this case the single-site potentials overlap, which is unde- 
sirable. However, in dielectric screening theory we did not bother 
about this overlap either for ordinary pseudoatoms, where it was 
very much stronger than in this case or in Thomas-Fermi MT pseudo- 
atoms, where it was of the same magnitude. 

The AS model not only preserves the electroneutrality of the 
Wiener-Seitz cell (sphere) (which means a fulfilment of the optimiza- 
tion criterion (3.65)), it also ensures consistency between the density 
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and potential (which means intrinsic self-consistency of the model). 
It should be neted that the AS model is different from the sometimes 
used model of overlapping MT spheres, in this the MT plateau is 
also present but the overlap results from large MT radii. 

ln the AS approximation the quantity U, plays the role of a new 
“MT shift” and may be chosen more or less arbitrarily. In Sec. 5.2 
we shall analyze the extent of this arbitrariness and discuss some 
aspects of the AS approximation as used in band theory. 

The AS model is an attempt to strengthen the MT approximation 
at its weakest point, the plateau. There have been other attempts 
because a model of spherically symmetrical potential is too con- 
venient to be easily given up. 


3.4.5. Improvement of the MT approximation. The first attempts 
were made long ago [247, 248]; any improvement involves some 
amendments both inside the MT sphere and outside it. In the latter 
case, the MT plateau is no longer a true flat plateau but rather 
a warped surface. hence the term “warped MT” [249]. Since it is the 
plateau where the MT model is most vulnerable, the introduction 
of potentials with warped plateaux is often the only measure used 
for improving the model. (Such potentials are called warped MT 
potentials [247. 250] or more generally, non-MT (NMT) potentials.) 

But is the removal of the plateau sufficient? We saw in Sec. 3.4 
that the d electrons are most affected by the inner (rather than outer) 
region of potential. This means that in the transition metals the 
band structure must be very sensitive to the actual potential inside 
the MT sphere and a warped MT plateau may therefore prove an 
insufficient measure. 

A large number of completely NMT calculations have been carried 
out for both nontransition [251-253] and transition metals (Fe in 
[254-256], V and Cr in [257]). 

The transition-metal calculations indicate the importance of 
NMT corrections. On the other hand, a semi-empirical Fermi-surface 
study for Mo and W has shown that the inclusion of deviations from 
the MT form improves the agreement with experiment. 

At present a considerable effort is being spent to eliminate the 
shortcomings of the MT model itself [259-261] (also see [261] for 
references). But even if such an NMT potential is constructed, the 
resulting Schrodinger equation would be difficult to solve. This 
topic is discussed in a number of works. In [262] and [263], for exam- 
ple, Williams and Morgan extend the phase function method to 
include nonspherical potentials. A number of workers [264-270] 
employ the idea [271] of replacing / and m, the orbital quantum 
numbers for a spherical scatterer, by new quantum numbers in 
whose representation the phase shift matrix is again diagonal. 
(Since the orbital momentum for a nonspherical scatterer is no longer 
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an integral of motion, the phase shift matrix cannot be diagonal in 
the l representation.) All these methods, however, are too compli- 
cated to be popular. 

ln spite of all this work. the effect of a departure from the MT 
model is still an open, and sometimes controversial, question. In 
[272], for example, the significance of NMT corrections was investi- 
gated for the typical transition metals Rb (completely empty d band), 
Nb (half-filled) and Pd (completely filled d band). The effect of the 
corrections on band structure was found to be all but negligible, 
with level shifts of only 0.002-0.01 Ry, with larger figures for Nb, 
as might have been expected. Shortly afterwards, however, relativis- 
tic calculations of the Nb band structure and Fermi surface [273] 
proved to be very sensitive to NMT corrections. It should be noted 
that the potential-constructing procedure adopted in [273] is different 
from that in [272]. The greatest NMT-related differences occur at 
the Fermi surface where an NMT potential gives rise to new (previ- 
ously absent) electronic orbitals which should be observable in 
experiment. 

The discrepancy between the two works is perhaps less significant 
than it may seem. In a metal like Nb where the Fermi level lies in 
the middle of a dense d band, small variations in the dispersion laws 
produce (or eliminate) intersections of some of E (k) branches with 
the Fermi level. This will result, as found in {273]*. in the appearance 
(or disappearance) of Fermi-surface “pockets”. 

Errors introduced by the MT approximation are larger in loose 
structures in which even a self-consistent density is not isotropic. 
This may be illustrated by a Fermi-surface calculation for tin [275], 
where the introduction of a warped plateau reduced, by nearly 
threefold, both the rms and maximum errors in reproducing the 
Fermi surface. 

A Fermi surface is thus expected to be very sensitive to corrections 
for the departure of a potential from the MT form. Unfortunately, 
this is difficult to verify (e.g., by the de Haas-van Alphen effect) 
because the shape of the surface is also sensitive to many other 
effects. The electron-phonon interaction, for example, while little 
affecting the dispersion law Æ (k) within the Fermi sphere, greatly 
influences the behavior of the Fermi-surface electrons [276, 277]. 
(Which suggests, conversely, the importance of NMT corrections in 
calculating electron-phonon effects.) 


3.4.6. Effect of œ on band structure. The shape of the Fermi surface 
also depends on other parameters of the potential, for example, on 
the a in the exchange potential (3.81). An increase in @ raises the 


* Self-consistency greatly reduces the difference between MT and NMT re- 
sults [274]. 
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potential in the inner region of the scatterer. In an isolated atom 
this results in lowering s and (more markedly) d levels. Consequently, 
in an isolated MT potential an increase in a will lower the d resonance 
relative to the MT zero while simultaneously sharpening it. This 
means, in turn, that the d-type band will be brought down relative 
to the s band (and sharpened at the same time). The result will not 
only be a modification of the band pattern but of the Fermi surface 
as well much like with NMT correction for nonsphericity. 

The a-dependence of the scattering characteristics of the MT-po- 
tential is a very useful (if somewhat unusual) model for describing 
the behavior of an atom during a crystal formation. It is worthwhile 
therefore to discuss the effect of & in some detail. 

Table 3.2 shows the position of the s-band bottom (£p,), the energy 
of the d resonance (Ega), the width of the d resonance (I) and the 


Table 3.2. Effect of œ on characteristic band levels [278] 








z Fe (3dë 482); Ry Cu (3d104s1): Ry 
% 2/3 1.0 2/8 1.0 
Ey, 0.4182 0.094 0.093 —0.012 
Eq 4.200 0.464 1.469 0.451 
T (Ea) 0.213 0.022 0.269 0.017 
Ea— Er; 4.018 0.367 1.376 0.463 


“height” of the d resonance above the s-band bottom (Ea — E£y,) 
as calculated in [278] for Fe and Cu for different a-values. The MT 
potential was constructed by the Mattheiss procedure from wave 
functions calculated by the X, method with the Latter correction 
and @ = 1 [101]. 

As expected (see Sec. 3.1), the s and d energies are lowered and 
the width of the d resonance is reduced as « is increased. 

Note that from general considerations the d electrons in Cu must 
be more localized than those in Fe and hence more sensitive to changes 
in the inner-region potential (and thereby in a). Table 3.2 shows 
that indeed all the Cu parameters vary more rapidly with œ than 
those of Fe. But since the d resonance in Cu is more a-sensitive than 
that in Fe, a decrease in @ may result in the Cu resonance being 
higher (and hence wider). Paradoxically, the d electrons in Cu may 
prove to be less localized than those in Fe! Table 3.2 shows that 
this is indeed the case at «@ = 2/3 when ES" > E} and [cus 
Tte, that is, the d band in Cu is wider than that in Fe. This is 
of course unacceptable. 
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3.4.7. The quasi-excited atom model. Admittedly some fundamen- 
tal mistake must have been overlooked in our apparently flawless 
argument. 

This mistake may have nothing to do with the variation of œ 
(see Sec. 3.1) and the finger can only be pointed at the assumption 
concerning the orbital configuration. Is it actually justified? Sup- 
pose an atom in the crystal is not in the ground state but in an excited 
one. For iron this means a transfer of an s electron to the d orbital. 


d band ds? 
characteristics 
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Fig. 3.7. The effect of configuration on the dispersion law of the V atom, non- 
self-consistent calculation. The assumed ground-state configurations is d?s?. 


The nucleus will thus be screened better (see Sec. 3.1), the atomic 
levels will rise and the d resonance will be higher in energy than 
it is in the ground-state case. The d-electron wave functions will then 
become more diffuse or, equivalently, the d electrons will somewhat 
delocalize. 

For copper the excitation assumption implies an electron transfer 
from a d to the s orbital. This will lead to the d shell contracting and 
the d resonance lowering and narrowing. 

The dependence of band structure on the assumed configuration 
has been reported. Figure 3.7 shows the band structure of V calculated 
in [240] for two configurations (the ground one is 3d*4s”). The energy 
is measured from the bottom of the s band (point [,) so that visible 
only are the rise in the d band relative to the s band and its broaden- 
ing in accordance with the above argument (the filling of d holes 
raises the d levels in energy). 

The excited atom model thus seems to be able to resolve the para- 
dox of the wrong d-band widths because in Fe it makes the d band 
wider and in Cu narrower. The trouble is that the model appears 
to be even more absurd than the initial dependence of scattering 
characteristics. Nevertheless, it is easily justifiable. 

On placing an atom into a crystal, it will be penetrated by the 
electron densities of neighboring atoms. The screening in the inner 
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parts of it will then become stronger than it was in the free atom 
and, naturally, all the atomic levels will be raised. Since no mecha- 
nism is there to compensate for the rise, the total energy must in- 
crease, which is unfavorable. To remedy this, the atoms should 
rearrange their valence shells which means a transition from the 
ground to any other (or, to use the accepted term, excited) level. 
Thus an atom in a crystal must indeed be in an excited state, the 
excitation coming from the screening potential Vser which is created 
by the eclectron-density “tails” of the neighboring atoms. 

It has been known for some time that the best agreement with 
experiment is shown by dispersion laws obtained by the following 
model [279-281]. In calculating the crystal density by (3.56), the 
wave functions are taken from a free-atom excited configuration 
while the occupancy of the electron shells is assumed to be that of 
the ground-state configuration. (This is reminiscent of the transition 
state discussed in Sec. 3.2.) For example, the single-site density for 
an atom in the ground-state configuration 3d"~-*4s? is 


p (r) = (n— 2) 934 (conf. 3d"~14s') + 2074 (conf. 3d"-!4s!) 
=(n—1)p3i (conf. 3d"-14s1) + of! (conf. 3d”-14s!) 


+ pit (conf. 3d°-14s*) — 935 (conf. 3d"14s!). 
As a result, we have 


p (r) =p%* (conf. 3d"-14st) + [pis —P 3d} tor cont. ag™-tue (3-87) 


The density of an atom in a quasi-excited state is thus that of 
the atom in an excited state plus the addition pêt — poat which 
on the one hand leads to an additional depletion of electrons in the 
inner region (ot is subtracted) and on the other to the enrichment 
of the outer region (02 is added). It will be noticed that this redistri- 
bution is complementary to the one arising in the density of the excit- 
ed state in (3.87) relative to the ground state density. It can be 
easily verified that for the ground configuration 3d"~4s! the effect 
will be somewhat different, namely, the outer region will be de- 
pleted. 

When such quasi-excited atoms are brought together to form 
a crystal, the “enriched” densities of neighboring atoms will pene- 
trate the inner region of a given atom compensating for the (associ- 
ated with the model) deficiency of electrons. If the inner region is 
not depleted in the model, an excess of electrons will be produced. 
We thus conclude that forming a crystal from excited atoms (as 
proposed in [279, 280] without any physical justification) we obtain 
more self-consistency than with ground-state atoms. 

However successful the initial choice for the crystal potential is, 
it is clear that the subsequent self-consistency procedure will alter 
the charge distribution: the charge inside the MT sphere will either 
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increase or decrease depending on whether the effect of neighboring 
densities was underestimated or overestimated. An example to 
illustrate this is provided by our calculations (in cooperation with 
G. M. Zhidomirov, O. T. Malyuchkov and J. S. Shpotin) for small 
transition-metal clusters. We employed the SCF-X,-SW method 
[87, 283] which is a cluster-oriented KKR scheme [284, 26]. The 
initial superposition of densities was calculated in the usual manner, 
that is, the excitation of the atoms was neglected. 

The model of excited atoms predicts that after the first iteration 
the charge of the MT sphere will be markedly larger or smaller (de- 
pending on the metal) than that produced by the superposed ground- 
state densities. In the course of a self-consistency procedure, the 
charge slowly returns to the initial (superpositional) value while 
the cluster’s energy levels move away from their nonself-consistent 
positions. After the self-consistency procedure, the charge of the 
MT sphere is always greater than that given by the density super- 
position. This nonmonotonic behavior of the charge inside the MT 
sphere should be kept in mind when comparing the results for a given 
screening potential model with those for a nonself-consistent cal- 
culation. As our estimates indicate, the superpositional MT-sphere 
charges are closer to the self-consistent values than are the charges 
resulting from the first iteration*. It should be emphasized that the 
charge we are talking of is an integral characteristic. A partial ana- 
lysis (with respect to the orbital quantum numbers) shows consider- 
able differences in the details of the charge distribution (cf. Fig. 3.2). 

The same effect may be taken into account in a somewhat different 
manner, viz. by first orthogonalizing the atomic wave functions with 
respect to each other. This scheme was proposed by Löwdin [12] 
and used in [286, 287] for calculating a crystal potential. In this 
approach one actually constructs the Wannier functions, that is, 
the approximate electronic wave functions are found prior to the 
band calculation. 

To sum up, when a crystal is formed, the effect of excitation leads 
to d functions becoming either more diffuse or more shrunk, depend- 
ing on whether the atom has its d shell unfilled or, respectively, 
filled. 

Interestingly, the excited-atom model is even capable of explain- 
ing effects that have little or nothing to do with ordinary band 
theory. One example concerns the way transition metals catalyze 
heterogeneous reactions. It is known [288] that copper sometimes 
acts as a metal with an unfilled d shell. This cannot be given a regu- 
lar explanation but it is easily interpreted in the excitation scheme 
because the excitation of an atom is only possible when accompanied 
by a partial depopulation of the d shell. 


* An analogous effect is reported in [285] for V, Cr, Nb, and Mo. 
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3.4.8. Additive screening in the MT model. We saw that the exci- 
tation of an atom is related to its being screened in the crystal. We 
will focus now on the mechanism by. which the additive screening in 
the above cellular model is determined. 

The electron density in a crystal o (r) consists of two contribu- 
tions: the atomic density ot (r) (alias the free-atom density) and 
the screening density Ap (r) which is made up of the density “tails” 
from neighboring atoms. A crystal density is usually calculated 


er? Ap 
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Fig. 3.8. Electron density for the dts? chromium atom, non-self{-consistent cal- 

culation [281]. Left—a free atom (dashed line) and an atom in the crystal after 

the superposition of neighboring atomic densities (bold line). Right—the cont- 
ribution from the neighboring atoms. 


from self-consistent atomic wave functions. This means that in 
computing the atomic density contribution, 0*t (r), the perturbation 
expansion is carried to all orders (cf. Secs. 3.4 and 3.3) instead of 
being truncated after the first-order term as in the dielectric screen- 
ing case. (We mean not the core electrons but those valence 
electrons that are concentrated within the Wigner-Seitz cell.) 

The screening MT density Ao also contains information about the 
valence shell configuration and is therefore a more suitable approxi- 
mation for screening than the uniform electron background in the 
dielectric screening theory. Fig. 3.8 shows these contributions to the 
density [281]. The contribution from neighboring atoms Ao (r) 
increases with r. In the outer region, near r = Rr, it is greater 
than 94 (r), which means that it is indeed “screening”. Can we ex- 
pect that if Ap (r) is replaced by a uniform distribution (as for 
(3.67)) the two screening theories will be equivalent in this respect? 

The answer is negative because given additive MT screening the 
uniform background is only formed by an external addition to the 
density rather than by the whole of it, as is the case in the dielec- 
tric formalism. The “internal” contribution to the electron density 
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is obtained self-consistently and the d-electron density remains atom- 
like which fact is ignored in pseudopotential dielectric screening. The 
electron-gas density from the rest of the atoms contained within 
the Wigner-Seitz sphere is relatively low. To calculate it, the atomic 
density ot (r) should be integrated from A, to infinity; this gives 
the electronic charge outside the free-atom atomic sphere: 


Z,= 40 | oat (r) r2 dr. 


Ra 


This integral can be calculated not only for the total density but 
also for its partial components; this will give the number of s, p 
and d electrons outside the free-atom Wigner-Seitz sphere. 

This sort of calculation presents no difficulties. Assuming a pro- 
gram of the type described in [101], we estimate the computer time 
needed for each element to be no more than 5 to 10 minutes on an 
IBM-360/40 (or similar) machine. Results of this kind (for Hartree- 
Fock densities) are given for 3d and 4d transition metals in [289]. In 
both cases the d charge outside the atomic sphere decreases along a 
period but since the number of atomic electrons increases with atom- 
ic number, the total outside d charge reaches a maximum in the 
middle of the period, decreasing again to the end of it. The percent- 
age of this charge ranges from 0.14 (Sc) to 0.60 (Mo). For the s elec- 
trons the range is only from 0.60 to 0.75. The total charge outside 
the atomic sphere is minimum in the noble metals (0.85) and maxi- 
mum in the middle of the 4d period (1.35). 

These data indicate that even though d electrons do contribute to 
the “free” electron gas in a crystal, they cannot be regarded as com- 
pletely free. The d electron density is, in a way, frozen inside the 
cell and is mainly influenced there by its own atom. 

Additive MT screening thus automatically takes into account that 
d electrons are neither localized nor actually free. Furthermore, 
since the atomic radius is determined by the crystal structure and 
so is the summation in (3.78) it turns out that a single-site MT po- 
tential depends on the arrangement of atoms whereas in the dielec- 
tric screening this dependence is lost. The MT model is thus clos- 
er to reality. It is interesting that a potential optimized within 
the dielectric formalism with respect to the electroneutrality of the 
Wigner-Seitz cell (see back to Secs. 3.2.5 and 3.3.4) also depends 
on the environment, namely, through the atomic radius. A nonopti- 
mized pseudopotential has no such dependence. 

The above comparison between the dielectric and additive screen- 
ing mechanisms may prove helpful for improving the pseudopoten- 
tial screening theory. Firstly, it is clear that a pseudopotential may 
be greatly improved if in optimizing it the crystalline environment 
is taken into account. Secondly, it is clear that transition-metal d 
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electrons may not be considered as completely free and some effective 
valency should be introduced to account for their being partially 
frozen. Perhaps the best choice for this valency would be the num- 
ber of the s electrons plus the number of thed electrons that happen 
to be outside the atomic sphere. This effective valency can be used 
both in calculating the ionic pseudopotentials and in their subse- 
quent screening with the dielectric function e (q) in which the ef- 
fective valency is involved. 


3.4.9. Sereening in alloys. Let us now compare the dielectric and 
additive screening mechanisms as applied to the theory of alloys. 

For simplicity, we consider a stoichiometrically ordered alloy of 
two components A and B each with different electronegativities 6. 
Let €, >€,. The levels in the A atom are thus lower than those 
in the B atom, which means that the A wave functions are rela- 
tively more compact and the atom itself is smaller. The B atom is 
thus more “metallic” and one would expect a charge flow from the B 
to the A atoms. 

In the dielectric formalism atoms are screened independently of 
each other by formula (3.32) and, as we saw in Sec. 3.3, charge may 
even flow, paradoxically, from a non-metallic atom to metallic one. 

This may not happen with additive screening. In a pure crystal 
made up of atoms of one sort, say B, all the charge that has been 
outside the atomic sphere of the atom B “is given back” to it by its 
neighbors. In an alloy, each B atom is at least partially surrounded 
by A atoms, whose wave functions are more localized than those of 
B. Therefore the charge will only partially “return” to the B atom. 
As a result, a Wigner-Seitz cell containing a B atom will be deplet- 
ed of electronic charge and the A atoms will receive more electro- 
nic density than they “gave away”. We thus observe a “flow” of 
charge from metal to nonmetal. 

It thus follows from the way the MT scheme was introduced that 
it is capable of accounting for the differences in the screening of 
chemically different atoms. The flow of charge is included automati- 
cally (even without self-consistency) and simply follows from the 
superposition of densities. The self-consistency process causes an 
additional redistribution of charge and the picture becomes more 
complicated than for pure components. As our calculations reveal, 
the charge is redistributed not only in the MT spheres of the compo- 
nents but also in the intersphere region which suggests that the 
model of charge flow between the A and B “atoms” is oversimplified 
and three regions instead of two (A and B) must actually be consid- 
ered. Of course, the MT model of superposed densities also takes 
this into account to a first approximation. 

Unfortunately, the MT model for an alloy has a number of serious 
disadvantages. Because of the different chemistries of the compo- 
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nents, the alloy’s crystal potential may be rather anisotropic; this 
possibility is artificially excluded out of the MT model. Moreover, 
the MT radii of the components are chosen very arbitrarily. 

Note that in the dielectric formalism some potential anisotropy 
is taken into account through the structure factor S (q). It would be 
desirable to combine the advantages of the MT model (the direct 
summation of atomic densities) with the anisotropy of the crystal 
potential. This problem was considered in [290] but it would carry 
us too far afield to discuss it. 


3.0. Average value of the screened potential 


3.5.1. The long-wavelength limit of the formfactor. Depending on 
the screening mechanism adopted we may come to different screened 
potentials and we would like to be able to compare them. One way 
of doing this is by using the average value of the screened potential: 


W = = \ W (r) dr. (3.88) 


In this expression the integration is over all space but the normal- 
ization factor is taken to be Q,, the volume per atom (the volume 
of the crystal being too large a quantity to be meaningfully used). 

If the potential is screened by way of the dielectric formalism, 
there is no need to calculate W in coordinate space to find W. This 
may be seen by noting that the long-wavelength limit for the form- 


factor of a screened potential coincides with W: 


W (q—> 0) => lim \ W (r) eia d3r = TV. (3.89) 


q>0 


Therefore the theory of linear dielectric screening predicts (by 
3.42) and (3.89)) equal average values for all potentials: 
= py ion (q) 


Į er = lim 
q0 E9) 


=Z Eh. (3.90) 


Since other screening mechanisms are also possible, the above 
expression needs generalizing. We do this by rewriting the form- 
factor of a local screened potential in the form 


Wer (q) = = \ e-ilk+a)r Wer (r) ekt qèr, 
The following argument applies equally well to a nonlocal pseudo- 


potential although its average value and the long-wavelength limit 
of the formfactor will be related to each other in a more complicated 
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way. Making use of (2.32), it can be shown that 


(kalek) = Dy În (ka, P) hy (k, r) We" (r) rdr. 


e 





L 0 

In the Fermi-sphere approximation |k + q | = |k | = kp. Using 
(2.153), we obtain 
{k+q|We'|k) = —# ip 2 (21 + 1) tan "Pb (kp) Pi (cos 9x 444)- 


(3.91) 
In the small-g limit we have 


(kW ky = — = ie 2 (21+ 1) tan nf (kp). 


It is easy to prove the identity 


1 — (EẸ)3/2 
Qo 3z 
Using the smallness of the phase and the definition of the Friedel 
sum, (2.91), we arrive at the following generalization of (3.90): 


lim (k -+ q|W°r]k) = —2 Eps. (3.92) 


3.5.2. Linear-screened potential. For Eqs. (3.90) and (3.92) to be 
consistent, it is necessary that a linearly screened potential (LSP) 
obey the equation: 

GF USP Z, (3.93) 


In other words, the Friedel sum for a crystal potential given by the 
linear (dielectric) screening method is equal to the ion valency. This 
is consistent with the NFE model: an isolated ion, stripped of its Z 
valence electrons and immersed in an electron gas retrieves its 
electrons from the gas. 

The first-order perturbation theory result for a crystal Fermi ener- 
gy is 


EF = hE kwek) = ER (1-22). (3.94) 


From (3.93) and (3.94) we obtain in the LSP model (291, 292]: 
EY = E}. (3.95) 


This result is easy to interpret: the free electron band is brought down 
as a whole by the amount equal to the average value of the crystal 
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Fig. 3.9. The shift of an NFE band by the average value of the crystal poten- 
tial— (a) linear screening potential, (b) minimum perturbation potential. 


The LSP model is thus described by Eqs. (3.90), (3.93) and (3.95); 
the total number of electrons scattered by this potential is Z. 


3.5.3. Minimum perturbation potential. If the valency of an ion 
immersed in an electron gas is Z* < Z, it is intuitively clear that 
the perturbation caused by the ion will be weaker than that caused 
by an ion with valency Z. This may be seen from Eq. (3.92). Since 
the Friedel sum is Z*, the ratio > /Z is less than unity, i.e., the magni- 
tude of the average value of the screened pseudopotential is less. 

It is clear that the minimum perturbation will come from a pseudo- 
potential with a zero Friedel sum 


F (EF)=0, (3.96) 


i.e., with the zero average crystal potential. From (3.92) and (3.96) 
it follows that 


lim (k + q|Wer|k) =0. (3.97) 
q~>0 
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In this case the Fermi energy of the crystal equals that of the free 
electron gas (see Fig. 3.90): 


Y = Ep. (3.98) 


The physical meaning of such a pseudopotential is simple: tho 
entity we have placed in the electron gas is a neutral atom and not 
an ion. The neutral atom already has all the electrons needed to 
screen the nucleus and so will not borrow them from its surroundings. 

For scattering on this potential the phase shifts may be either 
negative or positive, that is, electrons may either be repelled or 
attracted by it. The total number of electrons displaced by the po- 
tential is, by (3.96), zero which means that, on the average the po- 
tential described by (38.96)-(3.98) does not cause a perturbation in 
the electron gas. Hence its name, the minimum perturbation poten- 
tial (MPP) [291-293]. 

Note that in the dielectric screening scheme there can be no MPP 
for a Heine-Abarenkov type pseudopotential (see Sec. 3.2). The reason 
is that at small q the dielectric function behaves like g~°? so that 
(3.97) can only be satisfied if the formfactor of a non-screened poten- 
tial decreases as g~(2+®, ô being positive. But, ascan easily be seen 
from (3.44), for a Heine-Abarenkov pseudopotential 6 = 0, that is, 
in dielectric screening theory any such potential is always a line- 
arly screened one. 

In the additive scheme, an MPP is even possible for a Heine-Aba- 
renkov pseudopotential. For example, by screening this latter with 
(3.67), we obtain from (3.69) that the well depth of the pseudopoten- 
tial optimized “to the minimum perturbation” is 


Aye =~ As. (3.99) 


A pseudoatom corresponding to such a pseudopotential will be an 
MT and MP potential simultaneously. 

As pointed out earlier, an MT potential is not byany meansa min- 
imum perturbation one. An MPP is a scatterer that has all the “nec- 
essary” electrons, i.e. has a zero Friedel sum. There is no require- 
ment that all these electrons be concentrated in the Wigner-Seitz 
cell. In contrast to the electron density of the MT potential, the 
MPP electron density may extend to infinity because of its Friedel 
oscillations, which are absent, by definition, both in the MT potential 
itself and the charge density around it. 

A word should be said about the convergence of the perturbation 
theory series. For a pseudopotential having no bound states, we 
saw in Sec. 2.4 that by (2.118) the convergence depends on the aver- 
age value of the pseudopotential. It is clear that an MPP will ensure 
better perturbation series convergence than will an LSP. It is possible 
that optimization to minimum perturbation is stronger than that to 
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electroneutrality. The additive screening theory must then have 
certain advantages over dielectric screening. Formula (3.99) is a 
special case of such an optimized pseudopotential. 

Certainly, the minimum perturbation criterion is not the final 
word in the analysis of perturbation theory convergence. We saw 
in Sec. 3.2 that in this analysis the possibility should be taken into 
account that the crystal potential made up of single-site potentials 
may be weaker than each of the single-site potentials taken alone. 
It would be wrong, therefore, to argue that from the viewpoint of 
convergence an LSP will always be inferior to an MPP. 


3.5.4. Friedel sums. The next question to be discussed is the mean- 
ing of a Friedel sum. 

The Friedel sum for a given potential may be viewed as a condi- 
tion determining the Fermi energy of a crystal in which the poten- 
tial will act as an ion having a given valency Z’. To see this, let us 
demand that the number of electrons needed for screening the ion 
(i.e., the Friedel sum F) be Z’: 


F (EF) =Z'. (3.100) 


The scattering phase shifts for a given potential and the corre- 
sponding Friedel sums should be calculated for different energies, 
starting from Æ = 0 and then moving to increasingly higher ener- 
gies. The energy satisfying (3.100) will be the desired Fermi energy. 

Let us invert the problem. An ion of valency Z is placed in a crys- 
tal whose Fermi energy E¢ is characteristic of another valency. 
What is the Friedel sum? 

We transform Eq. (3.94) to 

GE (ES) —_4 
F (Er) 2/3Er 





(EY — Ep). (3.101) 


From Eqs. (3.101), (3.93) and (3.96) it follows that a given value 
of EF specifies a potential-screening model. If E°p = E%, Eq. (3.101) 
gives F (EF) = 0, that is, the potentials in the crystal have been 
chosen in such a way that their average values are zero. If EF = 
(1/3) E%, then ¥ (EF) = Z showing that the average values 
satisfy the rules of linear dielectric screening. 

It follows that specifying the Fermi energy of the crystal fixes the 
position of the free-electron band bottom relative to the vacuum 
zero: the lower E‘r, the deeper the band bottom (cf. Fig. 3.9). But 


this position is determined by the average value of the crystal po- 
tential, which determines the background, as it were, against which 
all the scattering processes evolve. Depending on the properties of 
the background, the pseudopotential may appear as either an ion 
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of valency Z or a neutral atom. It makes no sense to ask for the value 
of the Friedel sum unless the characteristics of the background are 
known. 

We thus come to the extremely important concept, namely, the 
effective medium surrounding the scatterer. 

In the theory of dielectric linear screening, the effective medium 
is simply a Sommerfeld gas. In the construction of an MPP (i.e., 
in the additive screening scheme) characteristics of the effective- 
medium have already been optimized somehow. In Secs. 4.4 and 5.2 
we shall see how an effective medium may be introduced in band- 
structure calculations. The concept of effective medium is employed 
in the coherent potential approximation in the theory of alloys. The 
scattering properties of a pseudopotential depend on the properties 
of the effective medium. When choosing a pseudopotential it is 
necessary to take account of the effective medium it will be 
placed in. 

Expression (3.101) can also be derived in a somewhat different 
manner. The Friedel sum represents the number of electrons with- 
drawn, as it were, by the scatterer from the stream of free electrons. 
Let Z be the number of electrons (per unit volume) in acrystal with- 
out the scatterer and Zer be the effective number of free electrons 
after the scatterer has been introduced. Assuming the equipotential 
surfaces in the crystal are spherical, we can write 


Q 








Z=: = (B3)3/2, Zor = =o. (ESE)802 
which gives 
F (EY) = Z (1 — (EF Eh)3??), (3.102) 


This expression is due to Lee [294]. It shows that at EF = Eþ, 
„F = Q as in the linear formula (3.101) so that for an MPP the two 
approaches agree with each other. With an LSP, however, they do 
not: by (3.102), Z = Z only for Ly = 0 and not for EF = (1/3) E. 
If, formally, we assume ,¥/Z to be small, then 

mcr po F 2/3 7 2 F 

ne = Ep (1 — Z ) ~ BR Z ), 
which leads to Eq. (3.101) [294]. Which of (3.101) or (3.102) is more 
correct may only be shown by an experiment (note though that a 
direct comparison is impossible). The choice may be essential if the 
formulae are considered as optimization criteria. 








3.5.9. Charge flow in alloys. We can apply Eq. (3.92) to the study 
of charge flow in alloys. From (3.92) we have 


Wan — — 2 ppa = , (3.103) 





where Z and £%*"' are given by (3.60) and (3.61) respectively. 


9— 01063 
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Using the smallness of Born approximation scattering phases, 
the Friedel sum way be rewritten as follows: 


_ 2 B 
FM we = > (21-41) tan yy 


J 


— he E S (21414) | wen (r) ji (kpr) r? dr 
li 


=c4F A" t+enF p. (3.104) 


Here we have introduced Friedel sums for the screened alloy pseu- 


dopotentials: 
A(B),all 


x 2 

FF ACB) =F >) (21+ 1) m2 : 
; (3.105) 

tan nf Pa- — ke \ Wey (r) 72 (ker) r? dr. 


The above Friedel sums differ from those of the pure components in 
that the dielectric function used for screening the pseudopotential 
Wah in (8.105) is that of the alloy rather than that of a pure com- 
ponent. It may be said that F (F3) correspond to the charge 
possessed by the A (B) pseudoatom in the alloy. Thus, the charge 
flow is given by 


AQ= FP — FP. (3.106) 
By the same argument, 


— gall 

m= — A- Ee “am . (3.107) 
Multiplying (3.107) by c4 (€p) and adding the results together we 
come to Eq. (3.103), which means that Eqs. (3.103)-(3.107) are con- 
sistent with each other. Using Eqs. (3.106) and (3.107) we find the 
difference between the average values of the screened crystal pseu- 
dopotentials: i 


We Wy = ETAL, (3.108) 


We have thus proved a theorem relating the difference between the 
average values of a crystal pseudopotential to the magnitude of 
the charge flow. 

When applied to screen an impurity potential in the atom’s sphere 
model (3.108) indicates that the potential is shifted as a whole by 





_ 2 / 9n\2/3 AQ 
AU,= + (-7) eae (3.109) 
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The result would be different had we distributed the charge AQ 
uniformly over the surface of a sphere of radius R, and then applied 
the Gauss theorem. 

The impurity potential shift as a means to include screening has 
been used in order to calculate the physical properties of aluminium 
and copper [295-300] with low impurity concentrations. In these 
investigations the MT potential of the impurity was shifted up or 
down until the Friedel sum computed for the MT sphere coincided 
with the valency difference between the host and impurity atoms. It 
was assumed both that the impurity atoms must retrieve their va- 
lence electrons and that all the electrons would be contained in the 
MT sphere, that is, there would be no electrons between the surfaces 
of the MT and the atomic spheres. Nevertheless, the results turned 
out to be in good qualitative (and often quantitative) agreement with 
experiment, and this even for transition metal impurities. A possible 
explanation is that the requirement for the Friedel sum to correspond 
to valency is a very strong optimization condition for an impurity 
potential. To a certain degree, formulae (3.108) and (3.109) are a 
justification for the above procedure of shifting the impurity poten- 
tial as a whole. 

Note that although the above optimization to the Wigner-Seitz 
cell “non-neutrality” has emerged within pseudopotential theory, it 
is not actually used in pseudopotential alloy calculations. 


Chapter 4 


Theory of pseudopotential 
formfactors 


41. Nonlocality, the energy dependence 
of formfactors and perturbation theory 


The topic of this section is the dependence of formfactors on two 
main variables, the energy Æ and the momentum transfer vector q. 
These are not entirely separable in this context and must therefore 
be considered together. 


4.1.1. q-Dependence of formfactor. A pseudopotential W (r) is 
usually called nonlocal if its formfactor (k’ |W |k) depends both 
on the vector difference k’ —k (i.c., on the momentum transfer 
vector q = k’ — k) and on the vector k itself. Nonlocality arises 
when the pseudopotential, unlike the original crystal potential 
V (r), is not a multiplication operator. In other words, a pseudopo- 
tential is nonlocal if it does not commute with exp (ikr). 

This may be seen by noting that for a local pseudopotential W 


(k’ IWIk) = | eH (r) eitr qèr 
=f È W (r) eth -we dsr =W (k' —k). (4.1) 
0 


For a spherically symmetrical pseudopotential, the formfactor 
depends solely on |k’ —k | and not on the angles between the 
vector k’ — k and the coordinate axes. 

For a nonlocal pseudopotential involving, for example, projec- 
tion operators, we have 


ak’ | D wo) Pi] ky 
l 


=- | ece SW) ji (kr) Yz (k) Y; (r) d?r 
l 


= S (2144) [ | ar wien (r) r2dr] P,(cos eye). (4.2) 
i 


0 


This formfactor depends on both the vectors k and k’. It can be 
shown from Eq. (2.34) that the formfactor also depends ong = k’ — k 
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and k. Either way, there are two vectors the formfactor depends on. 

A nonlocal formfactor is extremely inconvenient to work with 
because for any given pair of k and q the functions that must be cal- 
culated are rather complicated. To avoid this, the formfactor’s de- 
pendence on k and q is usually simplified. To see the meaning of 
these simplifications it is worthwhile examining the dependence 
W (a). 

Consider the scattering-theory simple-metal formfactor (2.172). 
Since V (r) is a multiplication operator, the first term is local. The 
q-dependence in the second term is determined by the factors (k + 
q |a), which require the explicit form of core orbitals ¢, to be 
evaluated. Taking Slater orbitals [14] 


bq =N a exp (—ar) (4.3) 


and employing the same device used to evaluate (1.25) we have 








(k +- q|a) = Na È o- ik+ore-ar gsp 
Q d 
æ 1 
= “ne | | ee il k+a1 7008072 dr d (cos ©) 
0 cf at 
0 — 
= — 8na N ,Q5* [(k + q)? + a?]-?. (4.4) 


Thus the second term in (2.172) falls off as y~4 whereas the origi- 
nal-potential formfactor does not decrease faster than q~*. Then, the 
simple-metal formfactor as a whole falls off like q™. 

Turning now to the formfactor of a transition metal, (2.170), we 
observe that its q dependence comes through the factors (k + 
q|V ja). These can be estimated by assuming that V (r) is a 
purely Coulomb potential. By (1.25) we find 


(k + q|V |œ) = — FN \ e` ik+q)r 1 e-r q3r 
Qo r 
= —41Ze?N y [(k + q)? -+ ayt, (4.5) 


that is, the “compensatory” addition to the potential (the second 
term in (2.170)) decreases as g~* and hence slower than that for the 
simple metals. As a whole, however, the transition metal formfac- 
tor decreases like q~*, that is, exactly like its simple-metal coun- 
terpart. 

We can see from (4.4) and (4.5) how the formfactor depends on 
Ok, the angle between k and k’ = k + q (see Fig. 4.1). 

Consider the two limiting cases Ok kx = 0 and Ox, = x for for- 
ward and backward scattering respectively. It can be seen that while 
for forward scattering the quantity [(k -+ q}? + a?]~ decreases mo- 
notonically with |q |, for backward scattering it has a maximum at 
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q = —k, i.e., it first increases with | q |. It follows that in the back- 
ward scattering case the second terms in (2.170) and (2.172) increase 
with |q |, so that at small |q | the compensation of the original 
potential is better than it is 
for forward scattering. The point 
Qo Where the terms in (2.170) 
and (2.172) cancel out, is deter- 
mined by 


(k+q)|W\k)-=0. (4.6) 


It is clear that for backward 
k scattering qə is nearer to the 
origin than it is for forward scat- 
Fig. 4.4. Relative positions of the tering. The backward scattering 
vectors k, q, and k + q. formfactor may thus be thought 
of as being closer to the origin 

than the forward scattering formfactor (see Fig. 4.2). 
Note that when calculating the band-structure energy (1.36) the 
backward scattering formfactor is more important than the forward 
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Fig. 4.2. Forward- and backward-scattering formfactors of Al. From Harrison 
73). 


scattering one. The reason is that the relevant second order pertur- 
bation theory denominators in (1.31) are small for backward scat- 
tering. In particular, the Bragg reflection condition corresponds to 
backward scattering, which means that singularities in (1.31) occur 
at momenta given by q = —2k,. The specific value the formfactor 
assumes at a Brillouin zone face (i.e., the value of the backward 
scattering formfactor) determines the applicability of the Rayleigh- 
Schrodinger expansion (see Chapter 1), the magnitude of the cohe- 
sion energy, and whether the crystal will be a metal or a dielectric. 
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The forward scattering formfactor is also important, and is used, 
for example, when the total scattering cross-section for a given atom 
is determined by the optics theorem [7, 39]. 

Since a rigorous local formfactor is impossible to obtain, artificial 
methods must be employed to eliminate the non-locality. For exam- 
ple, a quasi-local formfactor can be approximated for backward 





(a) 


Fig. 4.3. Illustration to the Fermi-sphere approximation. 


scattering or for some other (than x) empirically chosen scattering 
angle. This approximation is rarely seen, however. The most widely 
used is the following one [13, 73, 17]. 

It is assumed that the Fermi-energy electrons play the dominant 
role in a crystal. Accordingly, the wave vector k in the formfactor 
is taken to be kp. It is further assumed that at small | q | the magni- 
tude of the wave vector is conserved in a scattering process and it is 
only the direction of the vector k’ = k + q that changes (k’, as it 
were, “turns away” from k, see Fig. 4.3a). When k’ is antiparallel to 
the original vector k, there is no word of further “turning away” and 
what the vector k’ is left to do is to increase its magnitude at a fixed 
scattering angle. Clearly we are dealing here with backward scat- 
tering (Fig. 4.30). 

Mathematically, this approximation may be written as 


q S 2kp: |kp+q| = [kp], cos Ok kg = 1—0.5 (G/F p), 
q È 2kp: |Kr4-q| = (q| — Ikr, cos Ok k+ = — 1. 


This scheme, usually called the “Fermi sphere” or quasi-local ap- 
proximation, can be applied to any formfactor. 


(4.7) 


4.1.2. The linear E-dependence of the formfactor. We now turn 
our attention to how the formfactor depends on energy. 

We saw in Chapter 2 that a pseudopotential that replaces a true 
potential is energy-dependent. This, of course, causes some difficulty 
when calculating the total energy by (1.36). To obtain this quantity 
we must add together the energies of all the occupicd states. This is 
easily done if the energies are related analytically to the quantum 
numbers of their respective states or, in other words, if the explicit 
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form of the dispersion law Æ (k) is known. If, however, the pseudo- 
potential formfactor is energy-dependent, expression (1.31) becomes 
a nonlinear equation for Æ, the explicit form of the dispersion law 
is lost and the right-hand side of (1.31) cannot be integrated over k. 

A similar difficulty was encountered in Chapter 1 in the discussion 
of the NFE model. We saw there that a more correct expression for 
the dispersion law is given by (1.30) with energy-dependent de- 
nominators. Now it turns out that the numerators are energy-de- 
pendent too! 

It can be seen from (2.170) and (2.171) for the scattering theory 
formfactor that the formfactor may depend on energy either linearly 
or it may have poles. Since a zero in the denominator in (2.170) is 
also a pole singularity, it is appropriate that the singularities in 
(1.30) be considered together. 

We start with simple metals and hence with formfactors linear in 
energy. In a simple metal the Fermi surface does not touch a Bril- 
louin zone face, so there is no fear of singularities at k; = e, and 
the Rayleigh-Schrodinger perturbation theory is applicable. 

It is worthwhile noting here that in the case of simple metals the 
following way of reasoning is possible [73]. Energy Æ enters the 
formfactor in combination with large quantities Ea, which are of 
the order of tens of rydbergs, whereas in the conduction band Æ 
is typically no more than half a rydberg. But then even a rough 
approximation for Æ (k) will do because against the background of 
E, it will not introduce any perceptible error. Let us discuss the 
validity of this argument. 

Consider the second term in (2.172). The following transformation 
is identical (cf. (2.162)): 


Ea (a |k) = (a| Ea |k) = ((k| — V? + V(r) a ))* 
= (æ@|— V? +V (r)|k) = k? (ajk) + (a|F |k). (4.8) 
When substituted in (2.172) this yields 
(k -+ q| W |k) = (k + q| V |k) 
~2 [(k?— E) (k + qla) (a@|k) -+ (k +qla) (el V |k)]. (4.9) 


Within the ion core (i.e., for r < R,, here R, is the core radius) 
the wave functions P, form a complete set. Hence 


>) Dy (r) DS (x)= 6 (r—r') O(R, —r) O (R, —r'), 


where O (R, — r) is the Heaviside step function (1 forr < R, and 0 
for r > R,), which takes care of the fact that for r > R, the core 
functions vanish. 
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As a result, we get (noting that 1 — © (x) = O (—z)) 


(k-+ q|W\k) = k+q|V| (r) © (r—R,)|k) 


4nRe 
Qj 





(x? — pR, (4.10 


The formfactor thus contains two contributions: (i) the integral 
of the potential over the outside region (where the potential is rel-. 
atively low) and (ii) the oscillatory addition term whose amplitude 
is determined by the departure of # from k? (rather than from £,). 
The quantities Æ and k? are of the same order of magnitude and small. 
errors in the energy are therefore not as harmless as might at first. 
seem. 

Note that the first term in (4.10) corresponds to the potential 
with the core region removed. 

It follows from (2.29) that at large q the second term in (4.10): 
falls off like q™. 

Note also that the formfactor (4.10) is nearly local at least in the- 
sense that it does not depend on the direction of the vector k (the 
energy dependence still remains). Let us assume a first-order dis- 
persion law 


EQ (k) = k? + (k| Wk) (4.11): 


with the same pseudopotential as used in (4.10). Substituting (4.11): 
in (4.10) we find after a little manipulation that 


(ke + q|W k) EEO Re) (4.12) 
1— AnRe ji (qRe) 
Qo q 


We have derived a Hermitian local energy-independent formfactor’ 
and all our problems thus seem to have been rigorously solved. Or- 
have thev? 

Not eutirely, because we made a very important approximation, 
namely, that the core wave functions are well localized. This is. 
true for simple metals but it is not true, for example, for the noble 
metals where even free-atom d electron wave functions are poorly 
localized (equation (2.172) does not apply at all to the trausition 
metals). A direct calculation (see Sec. 3.4.8) shows that the d elec- 
trons are not localized in the core region, so much so that some 2° 
to 2.5% of the atomic d electrons (that is, 0.2-0.25 of an electron). 
lie even outside a Wigner-Scitz sphere drawn around a noble metal 
atom. The localization boundary for the d electrons is thus very 
blurred. This suggests that we need two core radii for the noble 
metals, one for the ordinary core and the other for the d electron 
core. We might then hope to obtain a local energy-independent pseu-- 
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‘dopotential of the type (4.12) for these metals. Local model pseudo- 
potentials have indeed been constructed for the noble metals. 

Were the original potential taken (in the spirit of the NFE model) 
in a purely Coulomb form, i.e., —Ze?/r (Z being the valency), equa- 
tion (4.12) would describe a one-parameter model pseudopotential 
that ought to be called a modified Ashcroft pseudopotential* be- 
-cause its formfactor is 


AnZe2 cos (¢R-) £ 

k Wik) =— 08 ate) 4.13 

(k -+ q|W|k) ~ Qoq? E 4uRe j1(qRe) l ) 
Qo q 


Were the energy dependence of the formfactor approximated by a 
zeroth (instead of a first) order of perturbation theory, the second 
term in (4.10) would go to zero and the denominator in (4.13) would 
be unity. The formfactor would thus become a simpler function 
-of q and (4.13) would represent the formfactor of the ordinary Ash- 
-croft pseudopotential. 

This simple example thus demonstrates that modeling the Æ de- 
pendence of the formfactor may modify its dependence on q. This 
modification would be most significant for small q, which are very 
important in the study of disordered systems. 

We have thus considered the modeling of the formfactor’s energy 
‘dependence. Note that with simple metals there is no need to elim- 
inate this energy dependence, provided it is linear and the disper- 
sion law is accurate to second order. This may be seen by noting 
that the formfactor (2.172) can be rewritten as 


W (q) = a(q)+ £b (q), (4.14) 


‘where the functions a and b are E-independent. Substituting (4.14) 
in (1.31) yields a quadratic equation for Æ, whose solution is 


— 1-b (0)— B 1 r 
E = HOE V8 +BY +e (+a (0) —4), 
(4.15) 
~where 
la (gn) |? = NV a@(gn) b* (gn) t- compl. conjugate 
A= 2 en ET ene Be 
|b (gn) |? 
C- > oe (4.16) 


n 
The integration of (4.15) over k is only slightly more complicated 
‘computationally than integrating the original expression (1.31), 
while the accuracy may be greatly improved. 


* The ordinary Ashcroft pseudopotential is discussed in Sec. 3.2.5. Its 
‘formfactor is obtained from (3.44) by setting A = 0. 
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Equation (4.15) has the same form as the solution for the two-band 
model discussed in Chapter 1 (Eq. (1.14)). However, while in that 
model the dispersion-law “splitting” occurred about the “average lev- 
el” (k? + (k + g,)*)/2, in the present case the k? term is under a 
radical and the average level is (—1 + b (0) — B)/(2C). In this 
expression, it is impossible to separate out a parabolic k dependence 
which is characteristic of the nearly free electron model. 

Equation (4.15) is now reminiscent of the dispersion law splitting 
that occurs in the LCAO model (devised for narrow bands). The in- 
clusion of the formfactor’s energy dependence thus makes an NFE 
model look like an LCAO model and the stronger the dependence, 
the more so. In the limiting case of a resonant Æ dependence, nar- 
row bands of the LCAO type will appear. 

The effect of the formfactor’s energy dependence on the band width 
may also be seen from (2.172). 1t follows from (4.4) and (4.5) that 
the sum of the orthogonalization coefficients must be positive, that 
is, relative to the attractive potential, the term SEa k+qalaæ) x 


Qa 

(œ |k) is compensatory in that if added to the original poten- 
tial’s formfactor it decreases this latter. If Æ is approximated by its 
maximum possible value, the crystal’s Fermi energy Ef, it is clear 
that there will be too much compensation and so the formfactor will 
be underestimated. In the NFE model reducing the perturbation 
makes the allowed bands wider while increasing it (i.e., using energies 
lower than EẸ) narrows the bands, as we would expect. 

To sum up, an energy-independent formfactor (in the “Fermi sur- 
face” approximation) must lead to wider bands than those generated 
with an energy-dependent formfactor with different contributions 
for different conduction band states. 


4.1.3. The “repelled” solutions. The next topic to be discussed is 
singular energy dependence. We start from the zero denominators 
of second-order perturbation theory. We rewrite (1.30) to read 


Bn 
En— E ’ 





E = EO (k)— >, 


where EH is defined by (4.11) and 
B, = Ik 4- gn |W |k)|?. 


(4.17) 


We designate the (as yet) unknown solution of (4.17) as Ex. Suppose 
Ex is near €m, then the dominant term in (4.17) is the one with 
n = m and so we have 


EQ)-Lem + V En EO) F 4B, 
By =O emt V em EDE Bm, ; (4.18) 
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If the pseudopotential is such that EM = 8m, then E, = 8m + 
| B,, | showing that we meet again the effect of removing degener- 
ation discussed in the Introduction, Section 5, in terms of the two- 
band secular equation. Equation (4.17) corresponds to many-band 
model, every nth term in (4.17) gives a correction term to Æ, thus 
preventing it from coinciding with e,. 

It may be said that perturbation theory pole singularities tend to 
repel the solutions from the energies corresponding to the poles. 





Fig. 4.4. The repulsion effect (cf. Eq. (4.17)). 


Denoting fı (E) = X Bn (E — &,)7 and f, (E) = E — E®, Eq. 


n 

(4.17) may be rewritten as f} (E) = f, (E). This equation is graphi- 
cally solved in Fig. 4.4. The band solutions for Æ are given by the 
intersections of the curves f, (£) and fa (E) and are marked by points. 
Three different initial approximations are shown. (They are not 
chosen arbitrarily but correspond to different potentials and different 
points of k space.) The arrows show how much the band solutions 
have been repelled from the corresponding first approximation 
value #W), 

The repulsion effect becomes stronger as the corresponding matrix 
element increases, and tends to put E, halfway between the poles 
corresponding to the energies €m and €,,4,. The nearer E@ is to a 
particular €m, the further the solution is repelled from it, the same 
being true of the next value €,,4,. Only when £@) lies halfway be- 
tween two poles are the effects of €, and €,,4, balanced (and kM 
is then a good approximation of Æ). If E@ coincides with E£,, as 
defined by fı (Æ.) = 0, then there is no repulsion and the band so- 
lution coincides with L™, 
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We have thus established that electronic energy levels in a crystal 
are repelled from empty lattice levels. 

The band spectra of a series of metals having the same lattice 
type but different valencies Z (and hence different pseudopotentials) 
should be similar because the levels of the low-Z metals will be 
pushed down to the values characteristic of large Z. and the levels of 
large-Z metals are pushed up to 
the values characteristic of low Z. 
The effect of Z isthus “averaged” 
(as illustrated in Fig. 4.4). 

Note that this similarity may 
also be predicted from group-the- 
oretical considerations. 

The repulsion effect explains 
why band levelsat different points 
of the Brillouin zone have 
different sensitivities to small 
variations in the potential. It 
follows from Fig. 4.4 (see Fig. 4.5) 
that if ÆÐ is equidistant between 
two empty-lattice levels, chang- 
ing off-diagonal pseudopotential 
matrix elements will have little 
effect on the levels whereas for 
an Æ, close to an €m a similar 
change will shift the level sig- 
nificantly. To put it another Fig. 4.5. The sensitivity of band 
way, the solution is repelled solutions Æ to the change in po- 
from levels neighboring ¢, to tential depends on the position of 
levels Æ.. When Æ® coincides rO relative to the energies He and 
with Æ., the solution Æ, is weakly the empty-lattice dispersion Jaws. 
dependent on changes in the po- 
tential. The positions of Æ, are different at different points of k 
space. Therefore a change in the potential at some points will bring 
the energy levels down, while it will force them up at others, and 
in some places the changes will have no effect on the levels. The 
repulsion effect also explains, for example, why energy levels at 
different points have different sensitivities to the exchange parame- 
ter a, the interatomic distance, the temperature, or impurity con- 
centration. 

We thus see that some levels in a band structure are potential- 
sensitive and some are not and that the positions of these latter are 
largely determined by the lattice type. We can in principle predict 
for a given metal, at which k points the levels will be more sensitive 
to the potential. This sort of information is very important when 
analyzing the errors that may occur in fitting a pseudopotential to 
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experimental band-structure data. If, for example, the fitting invol- 
ves the energy of an insensitive level, the error may be quite con- 
siderable. 

Certainly, the repulsion effect is simply a convenient way to illus- 
trate the nature of band-structure formation processes in crystals. 
Regrettably, such a rearrangement of energy spectrum is ignored in 
Rayleigh-Schrodinger perturbation theory. 

Level repulsion also takes place when there is a quasi-bound state 
that produces a singularity in the formfactor (2.170), as is the case 
for the transition metals. The difference is that an interaction with a 
quasi-bound level is a first-order perturbation theory effect, where- 
as an interaction with an empty-lattice level is a second-order one. 

Moreover, since a quasi-bound state produces a single pole, alk 
of the band states will be repelled from this pole. The repulsion will 
not be compensated for, for the simple reason that there is nowhere 
for the compensation to come from. 

The important topic which will be always of relevance in handl- 
ing energy-dependent pseudopotentials is the sign of the energy 
derivative of the formfactor 

diy, +aqlVla) («|V ik 
ae E+ al Wk) = — Yea . (4.19) 
a 

If q tends to zero, the numerator is positive, that is, the right- 

hand side of (4.19) is negative: 


ek t+a\ Wk) <0. (4.20) 


If q increases, inequality (4.20) remains true up to very large q 
values. For a Coulomb potential, it is not violated at all (see (4.5)). 

The sign of the derivative, however academic the topic might 
seem, is of principal importance in pseudopotential theory. 

From the scattering theory point of view, the positive numerator 
in (4.17) corresponds to a positive resonance width for a quasi- 
bound level or, in the context of Eq. (2.77), to the decay state (when 
an electron leaves the atom to enter the free-electron pool) [801]. 

On the other hand, a positive resonance width (negative energy 
derivative) means that after introducing a quasi-bound level into 
a continuum, the level will make the continuum levels move apart 
as shown in Fig. 4.6.* For a negative resonance width (positive ener- 


* The figure shows schematically the interaction (hybridization) between the 
quasi-bound level eg and the free electron band k?. This interaction can be de- 
scribed by a two-band model (see the Introduction, subsection 5) and results 
in a complicated dispersion law. 

The same scheme explains the interaction between s and d bands in the transi- 
tion metals. In this case we have a dispersion law gg (k) instead of the level eg 
and the spectrum has no gap similar to that shown in Fig. 4.6. 
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gy derivative) the continuum levels will merge to the resonance ener- 
gy. This difference can be most easily seen from (4.18), which in 
this case takes the form 
E — Wt Bo + V (k?= Fo)? + 4Bq 
2 


+ ’ 


where B, is the resonance width. 


If B, is positive, |E,—E_|—YV (k®?—E£,)°?+4B,> |k? — E, | 
and the continuum levels are pushed apart. For negative Bg, 
|£,—E_|< |k? — E, | and the levels converge. In the 
range Ea —2V(B,|<k°<£,+2V|8, |, the energy Ex 
acquires a complex addition term 
for negative resonance width, 
which means that the correspond- 
ing states are described by damped 
functions and are therefore 
unstable. From the scattering 
theory point of view a negative 
resonance width corresponds to 
a trap state, for which an elec- 
tron is localized on the atom. In 
the NFE model, the binding of 
atoms into a crystal is due to 
the delocalization of electrons; 
their localization on the atoms is 
tantamount to the collapse of 
the crystal. 

The use of a resonance pseudo- , 
potential with a positive energy Fig. 4.6. The sg level (horizontal 


deri . . dashed line) interacts with the free- 
erivative (negative resonance electron band (parabolic dashed line): 


width) may thus lead to an _ to form the dispersion law shown by 
entirely wrong description of the the bold line. 

band structure and hence all the 

crystal’s properties will be incorrectly modeled. It is for this reason 
that when studying formfactors in the next two sections we shall 
always be concerned with their energy dependence. 

It should be noted that the parameters of the model pseudopoten- 
tials currently in use are also energy-dependent. This dependence is 
described by a resonance formula with a negative resonance width 
but because of the “Fermi sphere’ approximation with E = Ep, 
nothing goes wrong. The sin of fixing the energy thus saves us from 
a much more serious error. 

On the other hand, it is clear that the E = Ep approximation as 
it stands is only valid for simple metals. For the transition metals, 
model pseudopotential theory faces a rather awkward choice: the 
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“Fermi surface” scheme can neither be used nor, less still, abandoned! 
In this connection, the reader should refer to [302] for a pseudopoten- 
tial renormalization procedure which alters the position of a singu- 
larity and thus reduces the effect of the “troublesome” energy de- 
pendence. Í 


4.2. The OPW formfactor 


We now turn to constructing the OPW formfactor, which arises, 
as the name implies, within the orthogonalized plane wave (OPW) 
method. First, a brief historical review. In fact, the pseudopotential 
idea could have been introduced in its present-day form as far back 
as 1940 when the OPW method was proposed as a tool for band 
theory studies [303]. This however was done by Phillips and Klein- 
man only in 1959 [304]. It became clear fairly soon that the OPW 
pseudopotential method was not useful for the transition metals. In 
1965 Ziman [70] modified the Korringa-Kohn-Rostoker (K KR) meth- 
od which is the Green’s function method adapted for band structure 
‘calculations, and in 1967 Heine [305] employed this modification 
(the KK RZ method) to construct the model Hamiltonian (see Sec. 5.1). 
Some time later Hubbard [81, 306, 307] discussed the bas- 
ic principles underlying the model Hamiltonian approach within 
the scattering theory framework. In 1969 Harrison [308] introduced 
‘the transition-metal pseudopotential. That his work is conceptually 
‘close to Hubbard’s is only natural if we remember (see Chapter 2) 
that pseudopotential theory is a special case of scattering theory. 


4.2.1. OPW pseudopotential. The construction of an OPW pseudo- 
‘potential is discussed below following the argument in Hlarrison’s 
paper [308] on transition metals, the simple-metal result will then 
‘emerge as a special case. 

We saw that the transition metals are characterized by having 
‘quasi-discrete states. In formfactor language, see Eq. (2.170), this 
means that the sum over œ has contributions from both bound and 
-quasi-bound states, either contribution introducing its own energy 
dependence. When constructing an OPW pseudopotential, it is 
necessary that the set of trial functions should contain both types 
of state. 

The function ¥k must then be written as 


Y= È Balk +g) + 2) Agia) + È Cald), (4.21) 


where the first term corresponds to the ordinary NFE plane-wave 
expansion and the second and third terms describe the interaction 
with, respectively, the core states | a) and quasi-stationary d states 


q4). 
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To actually perform the expansion, the explicit form of these 
functions is needed. The core orbitals |œ) can be represented by 
free-atom core functions. The crystal potential will move the cor- 
responding levels (the core shift) but leave the wave functions them- 
selves unaltered. In principle, the computational accuracy may be 
improved by taking these functions in Bloch LCAO combinations. 
This is the approach actually employed in OPW band calculations 
using a secular determinant (see [309, 310]). 

Throughout the following, | œ )’s are assumed to be the eigenfunc- 
tions of the crystal Hamiltonian because on forming the crystal the 
core region remains unaltered. The implication is that the core 
functions of neighboring atoms must not overlap. 

As for the | d} functions, we may choose the atomic d orbitals to 
model them but, since transition-metal d electrons are delocalized 
(d bands exist), these functions cannot be considered to be eigen- 
functions of the crystal Hamiltonian because the potential affecting 
the d electrons differs from the atomic potential. To take this differ- 
ence into account, we suppose that the crystal potential Ve may 
be related to the atomic potential by 


Ver (r) = Vat (r) —6V. (4.22) 


It is clear that the effect of the crystal Hamiltonian on the d 
states may always be written 


(— V? +V (r))|d) = (ea — A) |d), (4.23) 


where we have separated out an r-independent parameter eg and 
introduced an operator A. Note that thus far the partition into gg 
and A is quite arbitrary. 

Operating on (4.23) by the function (d | (on the left) and using 
(4.22) gives 


eq = Ea — (d|6V|d) + (d|A|d), (4.24) 


where at is the atomic d level. For the core states | a) the operator 
A is, by definition, zero and they satisfy (4.24) with d replaced by 
a. It is then clear, incidentally, that although the atomic core states 
remain unchanged on forming a crystal, the core levels are dis- 
placed, which fact should be taken into account. 

We assume that the crystal potential is spherically symmetric in 
the localization region of the d functions, i.e., d functions with 
different magnetic quantum numbers do not hybridize, or mathe- 
matically (cf. Sec. 2.2) 


(d’ | — y? 4- yer Id) = €gÔda . (4.25) 
Clearly, (d’ |d) = ga. From (4.23) we have 
(d’|A|d)=0. 


40—01063 
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Now we turn to the pseudopotential itself. By analogy with (1.18) 
we introduce a new function q, often referred to as the “pseudowave” 
function. The true wave function ¥ is given by (4.21) while ọ is 
given by the first term in (4.21). We have 

p= $ Balk + gn) . (4.26) 


ras 
n 


Substituting (4.21) in the Schrödinger equation gives 
(—V?-4 Ve) o+ 2) Aa (Ea — £)\a) 
a 


+2 C4 (&qg—E)|d) — py CzA|d)= Eq. (4.27) 


To evaluate the coefficients A, we multiply (4.27) by (a | to obtain 
Ea (219) + Ag (Ea — E)— 2 Ca(ajA|d) =E (a\9) , 


where the Hermiticity of the Hamiltonian and the strong localiza- 
tion of | œ) have been used in the first term. 

In [308] the operator A was assumed to be constant in the ion core 
region. This is of course an approximation on the shape of the screen- 
ing potential, but it considerably simplifies the formalism and is 
quite in the spirit of the MT scheme (see Secs. 3.4 and 3.5). The 
resulting expression for A, is of the type (2.155): 


A, = —(a|@). (4.28) 


. In order to evaluate Cg we multiply (4.27) by (d |. The calculation 
of the first term requires the assumption of strong d-orbital localiza- 
tion, otherwise a boundary correction term will appear, similar to 
that occurring in Ba in (2.171). This assumption also implies that 
the d functions of neighboring atoms do not overlap, i.e., the d 
bands are only formed by the perturbation potential ôV acting 
through the operator A. The result is 


Ca= —(d}q) +) (4.29) 


It will be seen that (4.29) corresponds to (2.166) in that both have 
the same resonant form and the same envelope functions for the 
perturbation potential matrix elements. This similarity is not re- 
moved by the first term in (4.29). 

As a result, Eq. (4.27) becomes 


(—V24LW+8,—E) p=0, (4.30) 


Ch. 4. Theory of pseudopotential formfactors 147 


where we have introduced the simple-metal pseudopotential w 
and the transition-metal resonance addition %, which are 


W =y — > (Ea — E)la) (a| — È (eu— EB) Id) (d| 


a 


+È (1d) d\A) + 2 Ald) (dl, (4.31) 


N Ald) (d]A 
S= — J Ae . (4.32) 


Note that (4.32) coincides with the second term in (2.169). 

Consequently, we arrive at expressions for the formfactor of a 
simple metal (where A = 0) and for the resonance part (cf. (2.170) 
and (2.172)): 


(k + q|W0PW |k) 
= (k-+q|Vor|k) — È (Za — E) (k+ qla) (alk), (4.33) 


Ald) (d|Alk) 
&g—E 


(k-+ q|B,|/k) = — JEEN (4.34) 
d 


Since (4.34) contains a matrix element between the functions 
|k) and |d), the term $, is often called the hybridization poten- 
tial [808, 89]. It should be remembered when constructing the form- 
factor that &,, like the original ion potential, will act as a scatterer 
of electrons and its screening must therefore be taken into account. 

The pseudopotential (4.33) and (4.34) can be used to calculate the 
total energy of a nontransition metal whose resonance energy gq is 
far outside the filled part of the conduction band (otherwise the per- 
turbation expansion diverges). Treating &, as a hybridization po- 
tential allows us to speak of a hybridization energy. It follows from 
the discussion of scattering in Chapter 2 that the “hybridization” 
should be interpreted as the interaction of a quasi-bound d level 
with the continuum of levels. This is just what this term usually 
means in the theory of transition metals, i.e., the interaction be- 
tween a wide nearly-free-electron band and a narrow atom-like one. 
In terms of the LCAO model, the wide band similar to those in non- 
transition metals is essentially duc to s levels while the narrow band 
arises from the d levels. The interaction between such two bands 
may therefore be somewhat lcosely termed sd hybridization. The 
same follows from the scattering theory approach: at low energies 
the scattering mainly affects (cf. (2.89)) the s component of the in- 
coming wave, that is, the quasi-bound d level is indeed mainly hy- 
bridized with s electrons. 

Apart from Harrison’s paper [308] there are a number of other re- 
lated works, in particular those by Moriarty [811-316, 181, 182] on 


10* 
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heavy alkali and noble metals. All of them, however, also treat the 
d orbitals as a separate group, an approach which automatically 
brings about the resonance form of 8, and so leaves one within the 
Harrison scheme. It is worth mentioning here Ball’s idea of an ini- 
tial background screening of an ion pseudopotential [177] which 
was employed in [181, 182, 316]. 

The energy dependence of the OPW formfactor is the same as that 
of the scattering theory formfactor and does not threaten level merg- 
ing (sce Sec. 4.1.3). Needless to say, the resonance form of the form- 
factor is an obstacle when using perturbation theory, but as d states 
are, by assumption, outside the energy region of interest, no 
convergence problems will arise (the quantity 8, is small because 
the energy denominator is large). 

Strictly speaking, the OPW formfactor (4.33), (4.34) has been set 
up for the noble metals and there is no reason to belicve that had 
we succeeded in constructing a transition-metal formfactor (which 
is questionable), it would also be of resonance form. In the above 
case, however, the resonance in the formfactor is actually brought 
about by quasi-bound states, as is the resonance in the scattering 
theory formfactor (2.170). Since this also applies to the transition 
metals, we would suggest that the transition metal OPW formfactor 
also has a resonance. 

That this is indeed true can be proved in a strikingly simple man- 
ner, without an intricate analysis like that for Eq. (4.34). The proof 
involves the secular equation of the crystal band-structure problem 
(see Chapter 1). 


4.2.2. Secular cquation and hybridization. The OPW pseudo- 
potential construction was based on the expansion (4.24) of the wave 
function ¥ in terms of plane waves and atomic orbitals. Since the 
expansion coefficients for the orbitals were determined from certain 
additional conditions prior to the calculation, the orbitals were 
“underprivileged” with respect to the plane waves and, strictly 
speaking, were not trial functions at all. 

Clearly, in order to improve the mathematics the atomic orbitals 
should be treated on equal footing with the plane waves and their 
expansion coefficients should be determined by the same computa- 
tional procedure. The extension of the class of trial functions always 
improves the accuracy of the method [37, 317]. 

Let us then take the wave function to be 


¥=Z Bylk) +2 Agia) (4.35) 


in which expression there is no sense in isolating the d orbitals be- 
forehand into a special group: they are included in the sum over a. 
To unify our notation, the plane waves are labeled by the vectors k 
themselves, rather than by the reciprocal lattice vectors gẹ. 
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We proceed by following the argument of Introduction, Sec- 
tion 5. Inserting (4.35) into the Schrédinger equation gives 


(H—E)| > Bilk) + 2 Aala) |=0. (4.36) 


By multiplying (4.36) by (k’ |, integrating over r and using the 
notation 


Hay = (a|H |b), Sab = {a|b) ’ 
where a,b={k}, {a}, we get 
>: (H — ES) cg By + È (H — ES)wa Ag=0. (4.37a) 
k a 


If we then multiply (4.36) by (a@’ | and integrate over r we get 
2 (H — ES)wx Be + D (H— ES)a'aAa= 0. (4.37b) 
£ a 


Expression (4.37a) may be written for the Ny functions |k), and 
(4.37b) for the NV, functions | a). The total number of equations is 
equal to the number of unknown coefficients. Equations (4.37a) 
and (4.37b) thus provide an homogeneous system for the coeffi- 
cients {B,} and {Aa}. In symbolic form, the system is 


H—ES),.: H— ES), B 
(_ Mkt ( )k'a ( k\_ 9 (4.38) 
(H--ESYgx% (H—ES)are Ay 
The system has nonzero solutions if 
H —ES),: H — ES)y a 
act ( | Wwe ( Me )=0 (4.39) 
(H— ES) ark (H— ES)a'0 


The upper left-hand block in (4.39) describes a band formed in 
terms of the NFE model, whereas the lower right block, a band in 
terms of the LCAO model. The off-diagonal blocks describe the 
hybridization of the two models (or the hybridization of the bands). 

Thus the band structure resulting from Eq. (4.39) contains bands 
of two types (the wide NFE and narrow LCAO) hybridized with each 
other. In particular, this model is consistent with the generally 
accepted view that the band structure of a transition metal is made 
up of wide s bands hybridized with narrow d bands. 

Next we consider the construction of pseudopotential formfactors 
using the secular equation (4.39). 

In the absence of hybridization we would obtain an ordinary 
equation of the plane wave method, 


det |( — ES)x'x| = det |(€n — E) ônn + (k + gn |W |k +gn)| =0, 
where (k + gn: | W |k + gn? plays the role of a formfactor. 
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The effect of hybridization on the wide NFE band may be esti- 
mated by subjecting (4.39) to Lowdin’s folding procedure (see 
Introduction, Section 8). The result will be 


det \(& — £) Sek + (k’ |V |k) + Fur (E)| = 0, (4.40) 


where 


Fi (E) = — » (H— ES)wa (H — ES)" Jaa (H — ES). 4.41) 


The functions |k) are assumed to be mutually orthogonal as are 
the functions | a): 


Sen = Sek, Sora = Saas 


Assuming the LCAO band to lie well below the conduction band, 
the functions | œ ) may be considered to be core states, i.e., the eigen- 
functions of the crystal Hamiltonian: 


Hag = (a H œ) = E Ôa, (4.42) 
Hva = (KH |) = D (1B) BIE |e) = Ea Ska (4.43) 
and we obtain 


(Eg — E) S vaða’ (E ,— E) S , 
Fix (E)=— > — = MA 
= — >) (Ea — E) (k' |œ) (a|k),j (4.44) 


x 


that is, the folding gives rise, asa result of the inclusion of the 
core orbitals, to a compensatory term in the simple-metal OPW 
formfactor. 

If the LCAO band is close to the conduction band, the | a) func- 
tions are no longer Hamiltonian eigenfunctions, that is, (4.42) is 
nondiagonal and (4.43) should be replaced by 


Hyra = >i Sip ga- 


As a result, the factors in the numerator and denominator of 
(4.41) do not cancel and we obtain [318] the same resonance-type 
energy dependence as that in (4.34): 

(H— ES) gra (HES) og oe 
Fix (E)= — Dy — rp §(4.45) 
a 


The energy dependence of the formfactor thus arises from the in- 
teraction with atom-like orbitals (the atomic orbitals that have sur- 
vived the transition to solid state). 
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The above argument may be reversed. Given a certain energy-de- 
pendent formfactor, we can write in terms of it the matrix elements 
of the secular equation and then “unfold” the equation into an ex- 
pression of the type (4.39), thus coming to the coexistence of wide and 
narrow bands. This explains why taking into account the energy 
dependence of formfactors (sce Sec. 4.1) brings about some features 
of the LCAO model. 

An important by-product of the analysis of Eqs. (4.44) and (4.45) 
is that it becomes clear why the parameters of the pseudopotential 
in the secular equation can be fitted by small-size matrices. The 
energy dependence of the pseudopotential formfactor is such that 
neglecting the matrix elements with large reciprocal vectors is 
equivalent to including them, by (4.44), into the formfactors of 
smaller matrices. In other words, the “neglected” terms are actually 
included in the pseudopotential’s parameters. This means that pseu- 
dopotentials fitted for calculating band structures from small-size 
matrices will almost certainly be inadequate for determining other 
crystal properties whose calculation does not involve convolution. 

It is clear, on the other hand, that treating a pseudopotential’s 
parameters as adjustable, it is possible even with a small matrix 
to obtain a band structure in a narrow energy range (for example, to 
map the Fermi surface). 

The simultaneous expansion of the wave function in terms of 
|k) and | a) was first suggested in [319, 320]in the context of in- 
terpolation schemes. The Hamiltonian matrix elements were treated 
as adjustable parameters. If they are calculated, then the band-cal- 
culation scheme is called the mixed basis method [321-325]. Equa- 
tions (4.35) through (4.45) are borrowed from [348]. 

Strictly speaking, the expansions in (4.21) and (4.35) are illegiti- 
mate since a basis made up of plane waves and atomic orbitals 
is “overcomplete” (a plane wave basis being complete in itself). As 
a result, the trial functions are linearly dependent and some of the 
rows of the secular determinant (4.39) are combinations of some 
others. This, of course, makes the determinant zero. It should be 
emphasized [13, 326] that this conclusion is independent of the 
values of the energy E and wave vector k. 

We demonstrated earlier that the OPW pseudopotential method 
arises from the secular equation (4.39). Now that we see this equa- 
tion is “wrong”, is there any justification at all for the use of the 
pseudopotential method? 

The linear dependence of trial functions only become significant 
for large matrices, say, 1000 x 1000 or more. But even before that 
the poor convergence of the OPW method manifests itself. The solu- 
tion (i.e., the energy level) must approach a certain limit as the 
matrix size increases. In the OPW method, even a relatively small 
matrix provides a good approximation to the correct value (as 
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given, for example, by another band theory method). As the matrices 
grow larger, the solutions start oscillating about this value. The 
OPW method thus seems to be of little help in improving the accu- 
racy of the eigenvalues [326-329]*. 

Even though the OPW determinant fails to provide sufficiently 
accurate solutions, it does not vanish as might have been expected. 
What saves the situation is that we can never get an infinitely large 
determinant, for which the overcompleteness of the basis would be 
important. To put it another way, the actual basis set we deal with 
(be it a plane-wave or OPW basis) is always truncated, with no 
linear dependence between the functions. 

The practical conclusion to be drawn from the above is that if 
carried through from first principles, the OPW pseudopotential 
method cannot be expected to be very accurate. In band-structure 
calculations the accuracy of the method is at most 0.01 Ry [827, 
328] and it is unlikely to be higher for other crystal properties. Need- 
less to say, the use of appropriate experimental data for improving 
the parameters of OPW formfactors will be always profitable. 

We shall see in Sec. 5.1 that the OPW method is not alone in 
having the property of overcompleteness. Even the Green’s func- 
tion method uses an expression of the type of (4.35) and is there- 
fore also overcomplete. 

Attempts have been made to eliminate the overcompleteness 
of the OPW scheme. The way this is usually done is to ignore plane 
waves with large wave vectors (which in practice cannot anyway be 
included in the expansion of VY). This removes the overcompleteness 
of the OPW basis and makes OPWs orthogonalizable [333-336] (the 
orthogonalization of each plane wave to core states prevents the plane 
waves from being mutually orthogonal). We can thus obtain an 
energy-independent OPW pseudopotential [337-339] (and this even 
for transition metals [340, 341]). 


4.2.3. Energy-independent formfactors. In Sec. 4.1.2 we actually 
considered a method for constructing an energy-independent simple- 
metal OPW formfactor using a model dispersion law (see (4.11)). 
The use of the OPW secular method makes it possible to do 
away with this energy approximation. To see this, we transform the 
OPW matrix element to the form - 


(ex — E) Sure +V wk — Dy (Ea — £) (k' |æ) (|) 
a 
= (ex — E) [ôw — >) (k’ |) (alk) | + Vier 
— >) (Ez — £x) (k' |æ) (|). 


* Among other things, the solution is sensitive to the choice of core level 
energies [75, 330-332] which in a real crystal differ from those of the frec-atom 
ones [73, 75, 330] (this can be observed experimentally [76-78]). 
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Introducing a pseudopotential w 
(k' I Ik) = (k' |V |k) — È (Ea — tx) dk’ |) (œk), 
Qa 


and a matrix Q,-, formed bythe overlap of individual OPWs 
Qa = (k"| — È Kla’) (ae 11 Yk) — È |æ) (a|k)] 
= k'k — È: (k’ |a) (a{k),. 
a 


we get the secular equation to read 
det |(ex— E) Qk + (k |Ñ |k) | ==0. (4.46) 


Since the matrices we are dealing with are finite in size, the deter- 
minant of the matrix Q is different from zero. The matrix Q may be 
taken out and since it is energy-independent, the zeros of the deter-- 
minant (4.46) will be given by 


det |(e,-- £) Suet (QH Ck" |W |k)| = 0), 


It follows from this expression that the formfactor can be repre-- 
sented by the matrix element 


(k |W ky = È (0k (k" |W |k). (4.47) 


This formfactor does not depend on energy. Unlike equation (4.12),. 
the transformation leading to (4.47) is exact. 

We will not discuss here the advantages or disadvantages of 
Eq. (4.47). Our purpose has been to demonstrate that the problem 
of an energy-independent OPW formfactor seems to be solvable in 
principle. 

It should be stressed that the fundamental difference between the- 
simple-metal and transition-metal OPW formfactors arises from 
the fact that atomic d orbitals are not eigenstates of the crystal Ham- 
iltonian. The d electrons cannot therefore he considered to be either- 
bound or free. Treating them as a separate group (in scattering lan- 
guage these are quasi-bound electrons) brings about a resonance- 
type energy dependence of the formfactor (see (4.34) and (4.45)). 


4.2.4. Depletion hole. An important feature of any pseudo- 
potential constructing procedure is the “orthogonalization (or de- 
pletion) hole”. 

The effect is essentially simple. Replacing the true wave function. 
by a model necessarily alters the charge in the region where the two- 
functions differ. The electroneutrality condition may thus be vio-- 
Jated, which cannot be tolerated. But then, having constructed a cer-- 
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tain pseudopotential, we are obliged to evaluate the corresponding 
variation in charge Zapı (the depletion hole) and somehow take this 
charge into account. The way this is done is as follows. 

Let the true function ¥ and model function @ coincide outside a 
model region of radius Ry, (which need not be the same as the core 
radius R, but is simply a certain length). Then the total variation 
in charge due to the replacement of ¥ by @ is given by 


Zan= D | UY lol? dr. (4.48) 


k<k pam 
The Schrödinger equations for ¥ and @ are 

(V° + V(r) — EY (r, E) = 0, (4.49) 

(V° +W r, E)— Ejot, E) =0. (4.50) 


We now proceed along the lines used in Sec. 2.2 for deriving the 
Fricdel sum rule. Namely, we write down (4.49) for an energy Æ 
and multiply it by ¥* (£,). Next we write down (4.49) for &,, make 
it complex conjugate and multiply by YF (£,). We then subtract one 
equation from the other and assume that Æ, is close to Æ., which 
allows a Taylor expansion for all functions of Æ. The resulting ex- 
pression for | ¥ (Æ) |? will then contain no potential. The same pro- 
cedure, when applied to (4.50), will lead to an expression for | ọ (E) |2 
involving an energy derivative of the pseudopotential (which has 
also been expanded). 

By substituting the expressions for | Y |? and | @ |* in (4.48), 
using Green’s theorem and taking into account that on the surface 
of the integration sphere the functions ¥ and ọ, by assumption, 
coincide, we find the charge of the depletion hole [342] 


ow 
Zan=— Dd | of 0) Ger Pe (P) ar. (4.51) 


kkp 


lt may be worthwhile before proceeding further to discuss this 
expression. At first glance it seems to be at odds with our earlier 
assumption that the replacement of the true wave function by a mod- 
el wave function necessarily gives rise to a depletion hole. Accord- 
ing to (4.51) this may only be true for an energy-dependent pseudo- 
potential. Had we constructed an energy-independent pseudopotential, 
Eq. (4.51) tells us there would be no such a hole. 

Strange as it may secm, this is indeed the case. Consider the start- 
ing formula (4.48). What it actually gives is the difference between 
two normalization integrals, one for the true function and the other 
for the model function. As (4.48) suggests, a depletion hole arises 
‘when the model function is not normalized. [t follows from (4.51) 
that if the depletion hole is different from zero, the pseudopotential 
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is energy-dependent. The lack of normalization of the model func- 
tion thus brings about an energy dependence of the pseudopotential. 

The derivation of the energy-independent OPW formfactor (4.47) 
was based on the separation of the Q matrix in the OPW normaliza- 
tion integrals and the subsequent “normalization” of the formfactor. 
Since there is a connection between the way the pseudopotential de- 
pends on energy and the lack of normalization in the model func- 
tions, clearly an additional orthogonalization between the OPWs 
and their normalization may lead to an energy-independent pseudo- 
potential, a subject for further study. 

We now proceed with our discussion of (4.51). This expression, is 
reminiscent of the Friedel sum, which appears as an integral over all 
states of the sum of time delays also defined through energy deriva- 
tives, viz. Tı = 0n,/0£. (Note that in scattering theory the states are 
labelled with their energies rather than with the wave vectors as 
in (4.51)). 

The energy dependence of the depletion charge is similar to the 
energy dependence of the model potential radius involved in the 
phase function method. We saw in Sec. 2.4 that the radius of the 
region which can be cut off from the scattering potential depends on 
energy. But then so is the amount of the “cut-off” charge. 

Note that (4.48) involves the wave functions of an electron in a 
crystal rather than trial functions. Accordingly, (4.51) involves a 
“crystal” model function rather than a trial (|k)-type) function. 
However, an order-of-magnitude analysis in [342] indicates that 
(4.51) may be rewritten as 


Zani=— D kg li. (4.52) 
k<k p 
The Zapı for the OPW formfactor is 
Zan = — J (Xew car) X (2 EKA), 
kK<kp a k<kp Pĝ 
(4.53) 


Here the first term is the depletion charge for simple metals and 
the second is the one for transition metals. A new “strange effect” to 
be seen from (4.93) is that in a transition metal the depletion hole 
may prove to be smaller than that in a simple metal. Thus far the 
difficulties we encountered in simple metals were invariably more 
pronounced in transition matals. 

The paradox is easily resolved by noting that the resonance- 
producing quasi-bound state is localized in the vicinity of the core 
(it would be wrong to say “inside” since for d electrons the core bound- 
ary is blurred, see Sec. 3.4.8). Hence a conduction electron whose 
energy coincides with the energy of this state can penetrate deeper 
into the core than an electron with a different energy. The “resonant” 
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electrons thus reduce the charge deficit produced in the core by the 
pseudopotential construction process, in this particular case by ortho- 
gonalization (the depletion charge is frequently called the orthogon- 
alization hole). With model pseudopotentials it turns out that the 
depletion hole is indeed small for the transition metals (within 10% 
of the valency) whereas for the noble metals it is much greater, rang- 
ing from 11% for Cd through 16% for Cu to 32% for Au [349]. 

In both OPW and model pseudopotential calculations the deple- 
tion charge is always taken into account. The way it is usually 
done is by replacing the valency Z, wherever it occurs, by the effec- 
tive valency Z* [73, 17, 344] which is 


Ze=Z + Zapl = Z (1 + ay. (4.54) 


(Certainly, the Fermi energy and wave vector are determined for 
the true valency Z.) 

To conclude this section, we note that the construction procedure 
for the OPW formfactor, Eqs. (4.40) to (4.45), must also apply to 
the LCAO model. We might have considered the effect of a wide s 
band on a narrow d band and obtained the secular equation 


det |Hora— ESwa + Fara (E)| =0, 
where 
Fara = -È (H — ES) a (H — ES) Jy (H — ES)wa. 


We would thus have considered the formation of bands from “atomic- 
levels Haa with the additional perturbation (pseudopotential) com- 
ing from the interaction of a given “atomic” level (through the 
continuum) with other “atomic” levels. 

It follows that the hybridization between wide and narrow bands 
broadens the narrow band, the reason being the increased perturba- 
tion. 

In broad terms, then, the pseudopotential method is not restricted 
to the OPW formalism but can equally well be developed in the 
LCAO scheme. It may be said that both “limiting cases”, the nearly- 
free-electron (NFE) and nearly-localized-electron model (we called 
it the LCAO), may be formally united in one secular equation (4.39). 
The possibility of changing from one model to the other cannot be 
claimed to have been proved here because we incorporated it our- 
selves when expanding the wave function V in terms of the mixed 
basis (4.35). 

It will be shown in Sec. 5.1 that a rigorous scattering-theory treat- 
ment justifies expansion (4.35) although the models the secular equa- 
tion generates in the limit of no interaction will be neither the sim- 
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ple free-electron models (as in the NRE approach) nor the simple 
atomic-electron models (as in the LCAO approach) but modifica- 
tions of them, viz., the empty lattice and individual cell models. 


4.3, Phase-shift formfactors 


4.3.1. Terminology. In Sec. 2.3.3 we introduced the scattering- 
theory pseudopotential by using the formula (2.56) for #7, and phase 
shift 1). Clearly any pseudopotential defined in terms of à, (E) can 
be rewritten in terms of phase shifts using Eq. (2.67). We shall there- 
fore call this a “phase-shift” (PS) pseudopotential. Its formfactor 
is defined in the usual way as the pseudopotential matrix element 
between plane waves |k) and |k + q). 

We shall see in Sec. 5.1 that the formfactors of the scattering- 
theory pseudopotential (2.105) coincide with the matrix elements 
of the secular determinant in the reciprocal lattice representation 
that arises from the Green’s function method. This secular equation 
is known as the secular equation of the Korringa-Kohn-Rostoker 
method in the Ziman representation (KKRZ). We shall now call 
the formfactor of pseudopotential (2.105) the KKRZ formfactor*. 

Another secular equation written in the reciprocal-space represen- 
tation is that of the augmented-plane-wave (APW) method [221]. 
It will not be discussed here in as much detail as the Green function 
method but will be sometimes referred to. Conceptually, it is inter- 
mediate between the OPW and KKRZ methods. Augmented plane 
waves (or the trial functions of the APW method) are set up in the 
same way as are orthogonalized plane waves (the OPW trial func- 
tions). Both are plane waves far from a scatterer. The difference is in 
the way the effect of the scatterer is described. In the OPW method, 
a plane wave is orthogonalized to the scatterer’s core states (E < 0) 
whereas in the APW method it is matched with a function describing 
the scattering, i.e., with the exact solution for Æ > 0. With the 
KKRZ trial functions, things are more complicated and a more de- 
tailed analysis is needed (postponed until Sec. 5.1). 

Since the OPW method employs scattering theory, the matrix 
elements of the OPW secular determinant are expressible in terms 
of phase shifts. Therefore the phase-shift formfactors will be dis- 
cussed with those of the augmented-plane-wave method. 


4.3.2. KK RZ and APW formfactors. To find a KKRZ formfactor we 
evaluate the matrix element between plane waves for pseudopoten- 


* Traditionally, the names of computational methods are chosen to reflect 
the approach used (e.g., the OPW, LCAO, Green’s function methods).The name 
“the KKRZ method” is however generally accepted. 


158 Part 1. Pseudopotential methods. Theoretical principles 


tial (2.105): 
(k-+q | WEEP | k)= k +a] $ (u — J) ô (r— R) Pim |K) 


Im 


=- È (u) | et 4086 (r— R) M (k, |x |) Yz (0) dr 


= J, TE (E) S, (k, k+q), (4.55) 


L 
where a 

TES” (B) = h(E) — 51 (E), (4.56) 

Sy = Rhy (k, R)hE(k-+-q, R). (4.57) 


The function hy, is defined by (2.33); A, is the logarithmic derivative 
for the radial wave function Ẹ, and is defined by: (2.66). 

An APW formfactor is more difficult to obtain so we shall only 
reproduce the final result (for more detail of the APW method see 
[13, 223, 345, 346)): 


k-+q | WY | k) 
= SE (Ek (k+) + +2 THs, (k,k+q), (4.58) 





where S; is defined by (4.57), and 
TP?” (E)= M(E). (4.59) 


The important point about Eqs. (4.55) and (4.58) is that the energy- 
dependent and energy-independent contributions, T} and Sz, are 
separated out to form a product. A function of several variables 
which is transformable into the sum of the products of functions of 
one variable, is said to be separable. For example, Green’s function 
2.9) depends on r and r’ and is a separable function of these varia- 
bles. For a spherically symmetrical scatterer, the wave function is 
also separable with respect to the magnitude and angles of the vec- 
tor r (see (2.26)). 

The APW formfactor has a nonseparable term, the first one in 
(4.58). This term is independent of the potential and the reason it 
appears is that the plane wave exp (iqr) has been integrated over a 
sphere of radius R, the range of the crystal potential, rather than over 
the Wigner-Seitz cell (the integral would then be 5,,,). Note that the 
KKRZ and APW methods employ finite-ranged potentials (MT po- 
tentials), as follows from definitions (4.56) and (4.59) in which there 
is a logarithmic derivative over the sphere. The introduction of 
pseudopotential (2.105) was only possible because we had assumed 
the potential to be of finite range. Had the potential extended beyond 
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this (muffin-tin) sphere, we would have been obliged to divide it. 
into two regions, one inside and the other outside the sphere; the 
inside potential could be replaced by a pseudopotential of the type 
(2.105) and we would obtain a separable term like that in (4.55): 
while the outside potential (corrections to the MT form, cf. Sec. 3.4} 
would give nonseparable terms (which are the Fourier transforms of 
the potential). 

The separability of the KKRZ formfactor is not limited by the 
energy-dependent functions being separable from energy-independ- 
ent ones. Separated also are the potential and structure contribu- 
tions to the formfactor (this is to be distinguished from the matrix 
element factorization we met in Chapter 1). The function S$, only 
depends on the crystal type, the lattice parameter, and the range of 
the potential (which is related to the interatomic separation) and is 
therefore determined uniquely by the crystal structure. The func- 
tion 7, is in fact only determined by the crystal potential’s charac- 
teristics. 

There is a major difference between the two phase-shift formfac-' 
tors. The KKRZ formfactor contains the difference between the 
logarithmic derivatives for the motion with and without scattering, 
whereas the APW formfactor only involves the derivative for motion 
with scattering. This means that in the empty lattice model the 
KKRZ formfactor is automatically zero because 


Aa Ino scattering = Jy (E), T¥ERZ (E) == 0 
The APW empty lattice formfactor is different from zero 
(k+q | waew | k) Ino scattering 
= SE (E —k (k +q)) 22 E)S 
— ( (k + q)) == +2 Jı (E) Sx (k, k +4). 





It follows that the KKRZ formfactor can be expressed in terms of 
the APW formfactor: 


(k+q | WRER? | k) = (k + q | wrrw | Kk) scattering present 
—(k+q l warw l k) Ino scattering: (4.60) 


The APW formfactor can be converted into a form very similar to 
the KKRZ formfactor. Using an equation from [223] 


Anh) 2 jı GR) 
k (k +0) ke 


+42 Di (gp h(k, R)) hr(k+a, R) (4.64) 
L 
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we oblain 


2 q 
+ DFE, k?) Si(k,k+q) (4.62) 
L 


‘where we have generalized (4.56) to 
Ti (E, By) =), (E) — J (E). (4.63) 


In the Fermi-sphere approximation E = Eẹ and k = kẹ so that 
the nonseparable term disappears (incidentally it no longer depends 
‘on the angle between k and k + q in (4.62) as it did in (4.58)), and 
the APW formfactor is identical to the KKRZ formfactor. 

Anticipating a further result, we note that the KKRZ formfactor 
has a singularity both at 2, = 0 and at jı = 0. Although this cir- 
cumstance affords nearly energy-independent formfactor for simple 
metals, the way transition-metal formfactors depend on energy be- 
comes more complicated. 

With the APW formfactor no such difficulties arise for the transi- 
tion metals. In the simple-metal case, the departure of E (k) from 


k? is small and the quantity T, is a weak function of energy for the 
same reasons KK RZ formfactor is. Moreover, since the functions A, 
and 4, in (4.63) behave very much alike, they compensate for each 
other’s energy dependence to some extent. 

The APW formfactor is thus in no way inferior to its KKRZ 
counterpart, especially for the transition metals where, as experience 
shows [346], the singularities in 4%, (£) greatly affect the conver- 
gence of the KKRZ secular equation (the same is true for the heavy 
alkali metals [347)]). 

For this reason, the KK RZ method is used very rarely in band cal- 
culations on metals, and preference is given to the APW and 
KKR methods and a modification of the latter, the model Hamilton- 
jan method. In Sec. 5.2 we shall investigate why scattering theory 
suffers this failure in the consistent pseudopotential construction. 
We shall find that the theory actually provides a certain class of 
pseudopotentials which includes the KKRZ pseudopotential but 
this latter is sometimes not quite satisfactory. It is possible to set 
up a pseudopotential having the same functional form as the KKRZ 
one, but with a “better” energy dependence (see also Sec. 4.4). 

Unfortunately, the KK RZ formfactor is the only one of this form- 
factor class to have been studied. In the following, therefore, we 
shall restrict ourselves to the KKRZ pseudopotential, whose speci- 
fic features will be easily distinguished from those “generic” to the 
class. 
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In the KKRZ formfactor (4.55), the sum over / may, in view of 
(4.56) and (2.89), contain only a few terms (lma, = 3) since at higher 
Z the scattering is small and therefore à; (E) ~ -7,(£). With the 
APW method, the analogous sums (4.58) and (4.62) should contain 
cnough terms to secure convergence, which increases somewhat the 
required computer time. 

On the whole, the APW method has proved to be a workable scheme 
for band calculations for metals and alloys [10, 223, 345, 346, 
348], although, in terms of scattering theory, it is less justifiable 
than the KKRZ method. l 


4.3.3. The formfactor’s q-dependence. Let us investigate how the 
PS formfactor depends on q. In the APW formfactor, we know that 
the nonseparable term decreases (see (4.58) and (2.29)) as 1/g, i.e., 
the APW formfactor as a whole falls off no faster than 1/g whereas 
the OPW formfactor decreases like 1/q?. 

The KKRZ formfactor has no nonseparable term, but it is easy to 
see that its g dependence is determined by the spherical Bessel func- 
tions (entering Sz) whose asymptotic value is such (see (2.29)) 
that the overall behavior is again 1/g. It thus follows that phase- 
shift formfactors decrease more slowly with g than do the OPW form- 
factors or the scattering-theory formfactors (2.161), (4.5), which are 
based on orthogonalized plane waves. 

Let us separate out explicitly the q-dependence of the PS form- 
factor. This was impossible in the OPW case. By applying (2.34) to 
the factor Ar (k + q) in Sz and using, for simplicity, real harmonics 
we obtain 


> TS, (k, k-+-q) = a F, (E, k) ji (qR)%(q), (4-64) 


L 





where 
F,(E,k)=4n >, CErrTyjy (kR) jy (ER) Yr (k) Yre (k). (4.65) 


The Gaunt coefficients can be shown to obey the following sum 
rule: 
, , 6 / (21+ 1) (U4 1) 
L L _ Shik, yf (2b 4) (21 + 1) uo, Z 
2 Cinli = A } rr Fi Cib (4.66) 
which when substituted in (4.65) gives a factorized expression 
F, (E, k)= fi (E, |k |) Yr (k), 


where , 
f(E, [k |) = V 4n (21+ 1) | l 
© xÈ, VIRU + 1) (21" +1) Cio, rojy (ER) jr (KR) Ty (E). (4.67) 
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The PS formfactor is thus reduced to the form 


D 7S1 (k, k+) = ASD AlE, BGR) Y (@) Yi (k), (4.68) 
L 


L 





which is convenient in that the angles of the vectors k and q enter 
on equal footing. 

In Sec. 4.1.2 we introduced the quasi-local Fermi surface approxim- 
ation. The purpose of doing this was to treat a nonlocal formfactor 
as a local one. Our reasoning was that for q œ> 2kp the forward-scatter- 
ing formfactor is of more importance. We may reason, however, in 
a different manner. In the evaluation of the band structure energy in 
(1.40) the summation goes over all vectors q, i.e., all directions of 
the vector q are included. The immediate idea is that we can do with- 
out the Fermi surface approximation by averaging the formfactor 
over these directions instead. This will give us another quasi-local 
expression. Or perhaps we might reason the other way round, name- 
ly: if computational difficulties arise from the dependence of the 
formfactor on the angles of the vector k, why not average over the 
angles? It follows from (4.68) for the PS formfactor that the two 
averaging processes are equivalent and the form of the averaged form- 
factor will be the same for both cases. 

Averaging (4.68) over the angles (of, for example, the vector q) 
gives 


k+l > TS, l= Ze | k+l TSi 1k) dO 
L L 


a 


= FD (E, k) ji GR) Hr (k) Ve | o (a) Y (0) d 
L 


R? . 
== f(E, k) jo(qR). (4.69) 
The zero index on f, greatly simplifies the calculations because 


Ci", ro = \ Y oY 10 79 AQ Ji=0 = Spear. 


41 
Va 
Then 

fo= È (21+1) Ti (E) ji (ER). (4.70} 


We have thus arrived at an exceptionally simple form for the 
formfactor (4.69). 

The model formfactor of Veljković and Slavić [849] has the same 
form as (4.69). The reason for introducing it is that its q dependence 
is the simplest possible. It is amazing that this model formfactor 
turned out to be obtainable within the framework of the scattering 
theory. 
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Let us now find the pseudopotential corresponding to the averaged 
phase-shift formfactor (4.69). It can be readily verified that 


R? . { a DANN 7 
D o (4R) =o | e-ik4 Dry ô (r—R) eikrg3y. 


Therefore the pseudopotential in question is the following local one: 


W (r) = hô (tr —R). (4.71) 


4.3.4. Formfactor’s Æ dependence. The next question to discuss 
is how the PS formfactor depends on energy. We shall start with the 
KKRZ method because its formfactor is better justified from the 
scattering theory point of view. A knowledge of its energy dependence 
may then shed light on the energy dependence of other formfac- 
tors. In particular, we shall learn where singularities may be expect- 
ed when the undoubtedly more convenient energy-independent 
model potentials are applied to transition metals. 

We saw in Sec. 2.4 that scattering theory in the Born approxima- 
tion is equivalent to using perturbation theory in pseudopotential 
calculations. Therefore the study of a formfactor’s energy dependence 
is best started from a discussion of Born scattering. 

The Born approximation is applicable when the scattering phase 
is small. In the zeroth approximation, the phase shift is found by 
replacing the wave function with a plane wave; in the first approx- 
imation a singular solution must be “admixed” through a zeroth- 
order phase shift: 


RE = jı (ur) — tan n?n (ur). (4.72) 
The expression for the logarithmic derivative A} is readily obtained 
by expressing A, in terms of tan ny, by (2.67) and then dividing 
the numerator by the denominator, i.e., 
Bop) wit tan on m 2 
àr (E) œ~ ZRI? [1+ 7 tan y+ (+ tan n) + ae (4.73) 


il 





The ratio j,/n; in (4.73) is, by (2.73), the tangent of the scattering 
phase for a perfectly hard sphere, tan B;. It can be seen that (4.73) 
converges if 


| tan y, | < | tan B; | (4.74) 


which, by the argument that followed equation (2.107), is the con- 
dition for weak scattering (the scattering by potential is much weak- 
er than that by a perfectly hard sphere). 

It may be said that expression (4.74) is a new convergence criter- 
ion, with respect to the Bargmann one (Sec. 2.4), for the pertur- 
bation expansion. It was actually (4.74) that we used when deriving 
(2.153) from (2.56). 


11* 
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In the first approximation, then, the depth of the scattering theory 
“pseudopotential” (or, what for this particular case is the same, the 
KKRZ pseudopotential) is 


B = tan n: 
Ai — J) = — “RP? xR) , (4.75) 


that is, the Born approximation is equivalent to the replacement of 
the phase n; in (2.106) by 7). 

We now turn to the energy dependence of a Born approximation 
pseudopotential. The energy behavior of the phase shifts (see Sec. 2.4) 
indicates that small phases are equivalent to low energies and (2.153) 
may be written in the form 


tan n? x a,x?!4+1, (4.76) 


This formula may also be obtained directly from the definition 
of the Born approximation phase, (2.153), using the asymptotic be- 
havior of the Bessel functions given by (2.28) and valid over an ex- 
tended energy range. 

Substituting (4.75) and (4.76) in (4.73) yields 


21-1 
T? =A 4, = — ((22 + 1)? aI [4- Rett K 
Ati y 
+ (faa) e 6 


It can be seen that a pseudopotential in the Born approximation does 
not depend on energy. This is extremely important for in fact it 
provides the basis for the theory of energy-independent model pseu- 
dopotentials (at least for simple metals, when scattering is small). 

A departure from the Born approximation can be represented in 
the form of a power series in Æ: 


tan y; (E) = ax?! + byx2tt+38 + | .., (4.78) 
TP (E)=-4,+B E+... . (4.79) 


We saw in Sec. 2.2 that the transition metals are characterized 
by having bound states and this results in tan n; going to infinity. 
The singularity in tan ny; (Æ) is related to that in A, (EZ) (by (2.83)). 
The Born approximation thus fails for the transition metals because 
the KK RZ pseudopotential depends on energy resonantly. Formally, 
in the transition-metal case we should replace (4.79) by 





a (E)= 04 BE + r , 


Dı 
Fi-- ey ` 


(4.80) 
T, (E)= A, + B,E + 
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Figure 4.7 shows A; and .¥; as functions of Æ for Mg [350a] 
and Zr [350b], and in Fig. 4.8 the logarithmic derivatives 
T,(£) =i, — J, are plotted for a number of transition metals [346]. 





0.2 0.4 E,Ry 0.6 1,0 1.4 E,Ry 


Fig. 4.7. Bold lines are logarithmic derivatives for typical simple (Mg) and tran- 
sition (Zr) metals. Data are from [350]. Dashed lines are the free-electron loga- 
rithmic derivatives, Jı. Note the difference in scale along the ordinate axes. 


It can be seen that for Mg the Born condition T, (E) ~ const is 
satisfied rather well, while for the other metals the energy depend- 
ence of the pseudopotential is appreciable. This is exactly what should 
be expected because Mg is one of the simple metals for which* the 
usefulness of the energy-independent pseudopotentials is beyond any 
doubt [17, 159]. The logarithmic derivatives shown in Figs. 4.7 and 
4.8 were computed in [346, 350] using crystal potentials whose con- 
struction was described in Sec. 3.4. 


4.3.5. KKRZ and OPW formfactors. We arc now able to compare 
the KKRZ and OPW formfactors. 

We note first that the OPW formfactor has a q-dependent separable 
term similar to that in the KKRZ formfactor. The former, however, 
falls off more rapidly with q. The reason for this is that in the OPW 
formfactor the direct-space integration of spherical Bessel functions 
(which are plane wave components) brings about an additional 
factor of 1/q; the appearance of this factor is illustrated by the inte- 
gration of the exp (iqr) term in the Fourier transform of the square 
well of radius R: 

f etard3r — 4y \ sing? 12 gr ——» 4nR LEIR 
qr qo 


q> 





* Similar curves are obtained in [350a] for Be. Our computer calculations 
for the alkali metals give the same results. The situation is more complicated 
for the transition metals. 
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In the KKRZ formfactor, this integration is not actually carried 
out (because of the 6 function), and this is why it decreases more slow- 
ly than the OPW formfactor. The slow decrease of the KK RZ form- 
factor with qg is thus not fortuitous, rather it follows from the funda- 
mental properties of the scattering theory pseudopotential (2.105). 





Dp n 








Fig. 4.8. Difference logarithmic derivatives A, = R? (A; — fı) in transition 
metals. E issmeasured in Ry, A, in arbitrary units. 


It should be recalled, before comparing the energy dependences, 
that in Sec. 4.1. we discussed the relation between the E and q de- 
pendences of the formfactor and noted that a correctly introduced E 
dependence may affect the q dependence. Since the PS and OPW 
formfactors have different q dependences, it follows that the E de- 
pendences may not be transferable from one type of formfactor to 
another. It is therefore all the more interesting that the basic fea- 
tures of the energy dependence (which are in fact responsible for the 
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form of the dispersion law, see Sec. 4.1) are the same for both type 
of formfactors. 

For a transition metal, the OPW formfactor is strongly energy- 
dependent as are the scattering theory formfactor (2.170), KKRZ 
(4.55), and APW (4.58) formfactors. For the OPW formfactor (which 
is the sum of (4.31) and (4.32)) we have in the vicinity of a resonance: 


d d 
age Eti IWY Ik) ~ ap Eta | B, | k) 
=— oo <9, (4.81) 


7 (ea — E)? 
and similarly for (2.170). 
For the KKRZ formfactor, in view of (4.80), 


-E kpa Wk) <0, (4.82) 


The energy dependence of a formfactor in the vicinity of a resonance 
is thus the same for all the models. This means that some features of 
the pseudopotential formfactor are independent of the ccmputation- 
al method we adopt (cf. Sec. 2.5). 

There is a subtlety in the formfactor energy dependence of a sim- 
ple (nontransition) metal. We can argue [73] that all Ea in such a 
metal lie deep down and that variations in Æ on the right-hand side 
of (2.172) and (4.31) are small as compared with Æa. This means, 
by (2.60), that a simple metal’s OPW formfactor is weakly energy- 
dependent which is consistent with the Born approximation form 
(4.77) of the KKRZ formfactor. When scattering is small, the latter 
formfactor is virtually energy-independent. If the energy dependence 
is weak but noticeable, we obtain the following from (2.172) and 


(4.34): 
kta WY | ky > 0. (4.83) 


Using the general result that b, in (4.78) is negative (this can easily 
be seen by considering the next-order terms in the asymptotic form 
of Bessel functions) we see from Eqs. (4.75) and (4.78) for the KK RZ 
formfactor that B, in (4.79) is positive: 


-E (k-+q | WE" | ky 0, (4.84) 


which is consistent with (4.83) but disagrees with (4.82). What is 
the physical (or rather mathematical) meaning of this disagree- 
ment? There are two kinds of singularities in the KK RZ formfactor, 

namely, those associated with the zeroes of #, at E = g, and those 
occurring at E = Ef, and related to the zeroes of jı (RY EJ. The posi- 
tion of Et depends on the quantity R which is related to the lattice 
parameter. When the lattice parameter is large, so is R, and since 
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RY E{ = const, Ef, is shifted downward in energy. Since T, in 
(4.55) has a minus in front of J, it follows that the energy dependences 
of à, and 3, compete with each other and the sign of dT ,/dE de- 
pends on “which singularity is nearer”. Forcrystals with large inter- 
atomic scparations, the dominant singularity is the one in.¥%,, which 
leads to (4.84). When the lattice parameter gets smaller, the energy 
dependences of Ay») and .7,=»9) compensate for each other, the J; 
singularity shifts toward higher energies and the resonance in Aj») 
predominates. In the alkali and alkali-earth metals the lattice para- 
meter is large, the formfactors are virtually independent of E and 
conventional pseudopotential theory is quite applicable. In the transi- 
tion metals, the lattice parameter is smaller and a resonance pseudo- 
potential is required. 

It follows from the foregoing that a pseudopotential may be thought 
of as consisting of two contributions (in the KKRZ method, 
À; and Jı) one cf whichis determined by the interatomic separation 
and the other by the electronic structure of the initial atom. For 
simple metals, the dominant role is played by the “lattice” pseudo- 
potential, while individual features of the specific substance only 
enter as small perturbations. This means that a new pseudopotential 
theory is conceivable in which these parameters (which are respon- 
sible for chemical bonds in a solid) would be explicitly used. One 
aspect of this approach is the possible optimization of the pseudopo- 
tential by considering environmental effects (see discussion in 
Secs. 3.3 and 3.4). 


4.3.6. Andersen’s radii. The important question to be discussed 
is the dependence of the formfactor on the range of the crystal po- 
tential, R. 

It follows from Eq. (2.104) for the scattering theory pseudopoten- 
tial (2.105) that for each / we choose, it has its own value of R. The 
question is: what criteria should be applied? 

Analysis of the R dependence of the PS formfactor requires a knowl- 
edge of the dependence of T, on R. Let us define the function A, (£) 
for all r by defining A, (E, r) to be 


1 





M(E, 1) = po EA (Es r). 
The Schrodinger equation (2.25) then will read 
dh 4 (L4-4 
h =- (n44) — E +V (ry HEL. (4.85) 


à; is now a function of two variables, F and r. We examine the effect 
of the E dependence on the r dependence. 

It can be easily seen that the derivation of (2.71) in Sec. 2.2 could 
in fact be carried out for a sphere of any radius R. It then follows 
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from (2.71) that for any r 
d , 


If r is sufficiently large and the centripetal potential term is smaller 
than the other terms on the right-hand side of (4.85) we get 


<4, (E, r) <0. (4.87) 


The r dependence of A, is thus similar to its E dependence and the 
function À; (r) has singularities, whose positions we shall denote by 
R'. It can be readily seen that R} depends on energy, i.e., the radial 
and energy dependences of the function A, (E, r) and thus of the PS. 
formfactor are related to each other. 

By (4.86), for any r a small increment in energy gives rise to a 
small decrease in the value of A; (£). Had it happened that R} (E + 
AE) > R! (E), it may be shown graphically that (4.86) would. 
inevitably have been violated. It follows from this that R(E + AE)< 
R! (E). Since instead of the R} (E) dependence we can consider 
the e, (r) dependence, any value of r will have its own singularity 
energy €,. Let us take an r and two energies E< and E, such that 


E< <€ (r) < E>. 


Since for E< the function A; is negative and for Ey positive (both 
values are close to the singularity but lie on different sides of it): 
we come to a formal contradiction with (4.86): 


My (E>) > M (E<), (4.88) 


which is immediately resolved, however, by noting that the limit. 
may not actually be taken: as E< —> E, the function À; is discontin- 
uous. 
By (4.85), 
dh 
E=Ey dr 








dài 
| dr 








E=, 


and the two functions, 4,(Z,) and A, (E<) should intersect at some 
point R4. This gives rise to the important equation 


i (Es, Ra)=A, (Be, R4). (4.89) 


Like (4.88), expression (4.89) does not contradict (4.86) simply 
because it is not differentiable with respect to E. 

The positions of the points R4 are related to the positions of Rj. 

It is clear that the values of R4 will be different for each J. Each 
pseudopotential component can thus be “optimized” in the sense of 
energy independence but, of course, only over a limited energy range 
and only approximately. This may be seen from the fact that the 
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functions A, (r) only intersect at energies lying on different sides of a 
singularity in à, Although the quantity R4 will depend on energy, 
the dependence will be weaker than that in the function 4, (E). 

The existence of the MT radii R4 was first noticed by Andersen 
[351] when he was doing a direct recalculation of empirically deter- 
mined energy dependence of the A, into their dependence on the MT 
radius. Some idea as to why and when these points of “energy inde- 
pendence” occur is given by the argument leading to (4.89). 

Although we were actually concerned with 7{PW, the argument 
may be equally well applied to 7KKR2, 

Because of the way they were introduced, the Andersen radii R4 
are related to the dimension Q ẹ of the region in which the model and 
true wave functions lead to equal normalization integrals in expres- 
sion (4. 48) for the depletion hole charge. It can be seen that if the 
radius R is such that the ¿th component of a phase-shift pseudo- 
potential is energy-independent, then the depletion charge corre- 
sponding to it by (4.52) must be close to zero, that is, R4 is the radius 
of the region Qy- 

Rather unexpectedly, we thus observe once again that physically 
equivalent concepts occur under different names in different theo- 
retical models. 

To end our discussion of the r dependence of 7,, we note that the 
singularities of À, (r) will be seen as inflexion points on a plot of the 
yn. (r) function. It turns out then that the r’s for which tan n; (r) 
goes to infinity alternate with the r’s for which i, (r) = 0. These 
two families of points cannot coincide which means that the ener- 
gies Æ, and g; also cannot coincide, a result obtained from (2.83) 
on entirely different grounds. 

Optimization of logarithmic derivatives to “energy independence” 
by using (4.89) and choosing R} will affect, first of all, the relation- 
ship between ant Sy, because they are multiplied by T,. This 
will modify the q dependence of the PS formfactor and of each of 
the functions S, (because changing R affects the argument (| k + 
q | R) of the spherical Bessel functions). We ascertain once again 
that the energy dependence of the formfactor is related to its q de- 
pendence. 


4.3.7. The formfactor’s long-wavelength limit. The optimization 
of the PS pseudopotential is closely related to the long wavelength 
limit of its formfactor. 

Consider the KK RZ formfactor (4.55). By letting q tend to zero 
we find that its long wavelength limit is f, (4.70). Making use of 
(4.55) and (2.106) and assuming the phase shifts to be sufficiently 
:small we obtain 


TERA tanni (E) nı (E) 


RR (xR) ~ xR? (HR) * (4.90) 
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In the Fermi sphere approximation we set k = kp and replace the 
energy in the denominator of (4.90) by the free-electron Fermi energy. 
This gives [352] 


-tr Reh? (l Ev 
Ck | WEEE | py aw y Ee) me) 
z V ERR} erR) 


Using Eqs. (2.33) and (3.91) gives [352]: 
(k | KERZ | k) ~ 4n 1 S (21 4A) (ES) 
l 
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T 3 EF Z 


(4.91) 

We have thus succeeded, with some reservations, in reducing the 
PS formfactor to the general form of pseudopotential theory, i.e. 
Eq. (3.92). It follows that the formfactor can be optimized both to 
“linear screening” (by equating the long wavelength limit to 
— (2/3) EF) and to “minimum perturbation” (by equating it to ze- 
ro). It is clear, on the other hand, that the Friedel sum in (4.91) is 
due to approximations and that in strict terms there is no relation- 
ship between the sum and the long wavelength limit of the PS pseu- 
dopotential formfactor. 

How then should a PS formfactor be numerically optimized and 
what should the optimization criterion be? 

The simplest way to change the magnitude of a potential is by 
moving it uniformly along the energy axis (essentially, varying the 
MT jump or MT shift, see Sec. 3.4). A check can be made either by 
a direct calculation of the Friedel sum from the phase shifts (taking 
the Fermi energy in the first order perturbation theory approxima- 
tion, (3.102)) or by the magnitude of the long wavelength limit of 
the formfactor. 

No first-principles calculations of phase-shift formfactors, not to 
mention their optimization, have been published. The matrix ele- 
ments (k + g, | WEKKRZ |k + g,) and (k + gn: | WAPW |k + 
g,,) are, of course, relevant in every band-structure calculation, 
but are never represented as formfactors. Therefore we will not dis- 
cuss this question now. 

If the local PS formfactor (4.69) is considered to be a model one 
with parameters R and A = R*f,/Q, then expression (4.95) serves 
as a definition for the parameter A: 

2 m F (Ep) 


A= — + Ep 7 


- (4.92) 


We now wish to verify whether or not the theory of the long wave- 
length limit is internally consistent. We shall evaluate the Friedel 
sum ‚Fp for the local PS pseudopotential (4.71) to see whether it 
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coincides with the one in (4.92). This Friedel sum we shall denote 


F (Er). Of necessity, we shall use the Born approximation and 
replace the phase by its tangent. Using (2.38) we obtain 


Fa (EF) === >) (20+ 1) n x 2 >) (21+ 1) tan n? 
l 1 
=e > (21+ 1) ) } (r V EF) fô (r — R) r? dr 


— -e e eriy ii(r VER) = —F QA, (4.93) 


Making use of (4.92) we then obtain the required relationship: 
Fy (BF) = — ŽE QA = F (EF). 


Thus, whatever the model radius or specific optimization proce- 
dure, the local PS pseudopotential (4.71) has a correct Friedel sum. The 
only difference from more rigorous nonlocal pseudopotentials is 
that (4.71) cannot be optimized to a minimum perturbation because 
its formfactor would then be identically zero for all q. 

Something should be said about the long wavelength limit of the 
APW formfactor. Using the same approximations that led to (4.90) 
we can write 

n tan nj nı (EŞ 
Jt HREP ~I (E f)— R?j? (kpR) VE’ 


Substituting (4.94) in (4.58), setting k = kF and using (4.61) in the 
form 


D; (2141) (seq Ji (eR) ji (kR) P, (cos Ok) = (k}— kik), (4.95) 
l 


(4.94) 


we find 
(k WAY | ky = ART Jo, (Eb Er) 





4 ba e3 (2141) dit rn) jı(kpR)+ (k | WwKRZ | ky 


Q KKRZ 
= xt (Eh— Ep) +k | WE" |k) 


oP) 
— SET (Eg — Ex) Z Bb 





(4.96) 


Since, by (3.109), F (Ep) œ (E> — Ep), we thus see from (4.91) 
and (4.96) that both PS formfactors are linear in the Fermi ener- 
8y Er. 
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This has been verified experimentally by employing the APW and 
KKRZ secular equations to reproduce observed Fermi surface cross 
sections using Ep and the phase shifts as adjustable parameters. For 
each (assumed) Æp value it proved possible [294, 353—360, 291, 292] 
to reconstruct the Fermi surface by an appropriate choice of phase 
shifts. This work yielded experimentally determined functions 
mı (Er) (not to be confused with y: (Æ) since at any given value 
E # Ep the surface of constant energy differs in shape from the 
Fermi surface). Long wavelength limit formfactors were calculated 
from these functions and they were indeed quasi-linear in Æp [292]. 
We shall discuss this in more detail at the end of the next section. 


4.4. Effective medium 
and pseudopotential formfactors 


4.4.1, Effective medium as an energy zero. We saw in Sec. 3.5 
that the perturbation a potential introduces into a crystal is deter- 
mined by the properties of the effective medium that models the 
environment of the potential. Depending on these properties the 
potential may disturb the electron gas strongly or weakly and accord- 
ingly we will be dealing with a strong or weak pseudopotential. 

In other words, the optimization of a potential is related to that 
of the effective medium. For example, raising the MT potential 
(inside the MT sphere) by V, is equivalent to lowering the effective 
medium around it (outside the sphere) by Vo. The relationship 
between the effective medium and the way the pseudopotential is 
optimized has not so far been investigated. The effective medium 
concept is used extensively in the theory of alloys (for modeling a 
metal-solvent [361-366]) and it should be noted that even a pure 
(monatomic) metal is an “alloy” in the sense that an array of perturb- 
ing potentials (impurity atoms as it were) has been introduced as 
it were into a uniform electron gas (metal-solvent). 

We see that a variation in the effective medium “displaces” the 
average potential around an MT sphere. This means that we can 
shift the energy zero outside the MT sphere while leaving it unchanged 
inside. Needless to say, some precautions must be taken to retain a 
correct description of the scattering processes. 

Let the wave function outside the MT potential (free electrons, as 
it were) be characterized by an energy F while the wave function in- 
side (a scattered state) by an energy Æ. Then the difference between 
these two energies determines the properties of the effective medium. 

The description of a scattering process requires that the “outer” 
function smoothly join to the inner function at the edge of the po- 
tential. Clearly, such a composite function will depend on two ener- 
gies. The scattering phase. shift will also depend on two energies, F 
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and Æ. We thus face the problem of developing a scattering theory 
“outside a surface of constant energy”, with the initial and final 
states having different energies. 

Note that the matching procedure for wave functions with differ- 
ent energies is not entirely new. In the augmented-plane-wave 
(APW) method the trial function “inside” the potential is a super- 
position of the radial solutions associated with energy £, while 
“outside” the potential it is a plane wave of wave vector k, with no 
conditions on the relationship between F and k. (Note, though, that 
in this case the functions are only matched with respect to the amp- 
litude.) This procedure is known not to prevent the APW method 
from being one of the most successful band structure schemes [13, 
345, 223]. In a sense, an explanation for this success is the exceptional 
flexibility of the APW trial functions. This enables independent var- 
iations of the energy Æ and wave vector k. 

It should be expected that the scattering theory “outside the sur- 
face of constant energy” will provide more versatile pseudopotentials 
than are those currently in use. 


4.4.2. Phase shifts “outside the surface of constant energy’. We 
introduce a wave function ®, (F, E) such that an ordinary solution 
of the radial equation on the constant-energy surface R, (E) will 
be given by 


R (E)= |o, (F, E)ô(F — E) dE =Q, (E, E). (4.97) 


We demand that @, should solve an integral equation of the 
type (2.4): 
D, (F, E)= ji (fr) + | Gi (F, r, ri) V (r) D(F, E, ri)rèdr, (4.98) 


where f = V F and the Green function is defined for “free” electrons, 
i.e., for those on the MT plateau (the effective medium): 


Gi (F, r, r) = fji (r<) n (fr>). 


This choice of Green’s function makes (4.98) satisfy the Schrödinger 
equation for the free motion of energy F (for all r). Equation (4.98) 
determines how ®, depends on F while its & dependence may thus 
far be as we please. 

Requiring that the function ®, inside the muffin sphere satisfy 
the Schrödinger equation for the energy E: 


(—v+y (r) ++ E) D, (F, E)=0, (4.99) 


we find for r< R 
D, (F, E) = A, (E). (4.100) 


For r œ R the function ®, is determined by (4.93). 
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By using Eqs. (4.98) and (4.100) we find the scattering phase out- 
side the constant energy surface (cf. (2.56)), i.e. 


R 
tan n, (F, E) = — f \ Afr) V (r) Ri (E, r) r? dr, (4.104) 
0 


which reduces the function ®,; for r > R to a form similar to (2.57): 
D, (F, E)=j, (fr) —n, (fr) tan n (F, E). (4.102) 


Equation (4.101) is a matrix element for the interaction potential 
V taken between the eigenstate corresponding to V, #, (E), and the 
free-electron solution jf; (ry F). Since n; (F, E) Æ (E, F), the 
order of the arguments is important. In the following the first argu- 
ment will always be taken to be the energy of the “incoming” wave 


(i.e., that occurring in j; (rV F)), the second will be taken to be the 
energy of the scattered wave (involved in 2, (£)). 

Since at r < R the function &, coincides with the Ð, which is 
smoothly defined for all r, we require that the M, defined outside the 
MT sphere should smoothly join to the solution defined inside. This 
means that ®, should be continuous at r = R in both amplitude and 
derivative. For the logarithmic derivative A; (E) defined by (2.66) 
we obtain 
7, (fr) — 3, (fr) Mt 


(E) 
tan n (Py E) = nME yen 


(4.103) 


and this is a generalization of the usual expression (2.67) for scatter- 
ing “on a constant energy surface”. Equation (2.67) can be retrieved 
by setting F = E. 


4.4.3. The KKRZ formfactor and the effective medium. To see 
how the choice of effective medium influences the KKRZ formfactor, 
consider scattering from the potential of the Wigner-Seitz cell (3.73) 
(see Fig. 3.6). There are two natural zeroes for this potential from 
which to measure the energy, namely, the MT plateau (the energy 
Emr) and the vacuum zero (the energy Æ,). There are three character- 
istic regions in the problem, namely, 0 < r< Ryr (region J), 
Rur <r < R, (region II), and Ra < r< œ (region TIT). 

It may be argued that if in region JII an electron has an energy 
E,, its energy in region JI is 


Eur = E, + Vo. (4.104) 


In region J there is no potential plateau and the electron may have 
either of the energies, E, or Emr- 
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To see how the scattering theory formulae are modified when Eyr 
is replaced by E,, we introduce an MT potential 


Vur (7) =V (r) —V, (4.105) 
into region Jf. 
The Schrodinger equation in this region takes the form 


(— Vi + Vmr (r) + —=— UFD Eyr) Ri (Emr)=0. (4.106) 


Shifting the energy zero, substituting (4.104) in (4.106) and using 
(4.105) we find 


(=v V eE E) AEM (Ewr)=0 (4.107) 


Ws 


which is simultaneously an equation for %;'~, so that 


RYS (Ey) = RY" (Emr). (4.108) 


‘Consider now region JII. Obviously, the solution is 


2 (E,) = 4 lji (r V Ey) —m (r VE,) tan ny" (E,)]. (4.109) 


Difficulties come from region JI because here the solution has to be 
matched with the solutions for regions J and JJI. The task is sim- 
plified by using the atomic sphere (AS) approximation (see Sec. 3.4.4) 
which (i) replaces the polyhedral Wigner-Seitz cell by an equiva- 
dent (Wigner-Seitz) sphere, and (ii) extends the MT sphere to the 
boundary of the Wigner-Seitz sphere (3.86). 

In this approximation region JI disappears and the solutions of 
regions J and J smoothly match at Ra. For the scattering from the 
Wigner-Seitz sphere we have (dropping the index on Ra) 


j (R V Eo)— ji (RB V By) AWS (Ep) 


a) lR VE) n RVR CO 
This, in view of (4.108), can be rewritten as 
it (R V Ep)— jı (R V Ep) AMT (E 
tan nS (E,) = ji (RV Ep)— ji (R V Ey) MTT (Emr) (4.111) 


n (R V Ey)—n (R V Ey) AMT (Eyr) 


which, in fact, describes the MT-potential outside the constant ener- 
gy surface: 


tan n5 (Z,) = tan nMT(Z,, Emr). ; (4.112) 


The AS approximation thus automatically leads to a scattering 
theory valid outside the constant energy surfaces of MT potentials. 

Having developed the mathematical formalism, we are now ready 
to investigate its physical meaning. 
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The definition of the MT potential does not involve V, explicitly. 
It follows from Sec. 3.4 that the purpose of introducing this quantity 
is to “hold” electrons within the Wigner-Seitz cell and it has there- 
fore the meaning of the average kinetic energy of the electrons con- 
tained in the cell. This means that V, is the average effective-me- 
dium potential involved in the pseudopotential optimization pro- 
cedure in Sec. 3.5. 

The Lloyd pseudopotential may be easily generalized to include 
scattering outside the constant energy surface or, what is the same, 
to include the influence of the effective medium on the properties of 
the scatterer. By the same argument used in (2.103)-(2.105), we find 
from (4.101) an analog of (4.63) 


TEP? (Eyr, By) = (7 (Emr) — Ji (Eo))r = Ti (Enr, By). (4.413) 
The formfactor of this pseudopotential is evaluated as before 


(k+ q| WEER4 (Eur, Ey) |= X T, (Eur, Ep) S (k, k+q, Ra) (4.114) 


— 


L 


giving the generalized KKRZ formfactor mentioned in Sec. 4.3. 

We next consider how this generalized KKRZ pseudopotential de- 
pends on the properties of the effective medium. 

The simplest approximation possible would be to give up opti- 
mization, neglect totally the effective medium, and when construct- 
ing the MT potential ignore the effect of all the other MT poten- 
tials in the crystal lattice on the given one. It may be said that in 
this approach no attention is paid to the electroneutrality of the 
Wigner-Seitz cell. 

We thus assume that the potential is embedded in the electron 
gas. The result we are supposed to obtain is .+ = Z, that is, neg- 
lecting the effective medium is actually equivalent, in terms of 
Sec. 3.9, to optimizing the potential or in other words we must ob- 
tain the linear screening potential (LSP). Since, by assumption, 
V, = 0, Eq. (4.104) implies that E, = Emr and we are led to a 
conventional KKRZ formfactor. 

A more elegant approach is to choose Vy as the average value of 
the electron kinetic energy. The Jı involved in the KKRZ form- 
factor then depends on energy much less strongly. 

Certainly, the most bizarre (although, as will be shown in Sec. 5.2, 
a strict) approach is to use an energy-dependent effective medium: 


V, = Eyr- (4.115) 


It then follows from (4.104) and (4.115) that thefenergy of the ”in- 
cident” electrons is zero, Æ, = Q. In this case the energy dependence 
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of the KKRZ formfactor is determined by the function A; alone: 


TE (E, 0) = (47 (E)—F), (4.116) 


lt will be noted that like any other pseudopotential, the one in 
(4.113) affects the gas of the free electrons it scatters. The energy of 
a scattered electron is equal, in the AS model, to the energy of re- 
gion JJI (see Fig. 3.6), which is E,. Consequently, the Schrödinger 
equation for the pseudopotential (4.113) takes the form 


(—V?+ WKERZ (Eur, E,) — E,) Y (Eur) =9, 
and the perturbation expansion (1.30) will be 
E, = k? + (k | WS*P4 (Eyr, E,) | k) 


| (k-~-q | WERRZ (Emr. Ey) | k) 1? 
q0 


The dispersion laws are nevertheless determined by the dependence 
Eyr (k), that is, (4.117) must be solved for Eyr rather than £,. 
This is proved rigorously in Sec. 5.2. 

Note that in the AS approach the unseparable term in the APW 
formfactor (4.58) or (4.62) must vanish. This may be seen by writing 


4™ pə j1(¢R 1 “ae 1 e . 

QR RZ aeia =a eiT dèr =5 \ edt 3) — Sq, 0. 
over Wigner- over Wigner- 
Seitz sphere Seitz cell 


To get the third expression from the second we took into account 
that in the AS model the Wigner-Seitz sphere coincides with the 
Wigner-Seitz cell, and to get the fourth from the third we took into 
account that plane waves in an infinite crystal are normalized to the 
Wigner-Seitz cell. Thus the only term to survive in the APW form- 
factor is the separable one corresponding to the scattering “outside 
the surface of constant energy”. 

Let us return to the KKRZ formfactor (4.114). By choosing the 
parameter V, in the form 


Vo = Eur — k? (4.118) 


we obtain from (4.104) 
E, =F (4.119) 


and, as can be seen from (4.113), (4.114) and (4.62), this choice 
of the effective medium changes the KKRZ formfactor into the APW 
formfactor. 

Different V, yield different Friedel sums, that is, different opti- 
mizations of the KKRZ formfactors. 
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4.4.4. Experimental data. Suppose we are given the reverse problem, 
namely, that of fitting pseudopotential parameters to experimental 


data. ln this case, we specify 


the value of EG (rather than Vo) 
in order to reproduce the Fermi 
surface shape. Hence we specify 
the characteristics of the effective 
medium and impose consciously 
or unconsciously some conditions 
on the pseudopotential. The re- 
sult is (aS was mentioned in 


Sec. 4.3.7) that for any given EF 
the PS formfactor has a special 
set of parameters [291-294, 353- 
360]. These may be either phase 
shifts ņ; or logarithmic deriva- 
tives 4. The dependence n: (EF) 
is illustrated in Fig. 4.9 where 
the Friedel sum is plotted as 


a function of Ey. lt can be seen 
that theory and experiment are 
in reasonable agreement. 

The Friedel sum ¥ is related 
to the long wavelength formfactor 
limit as approximated by (4.91). 
Comparing (38.101) and (4.91) 


F 
2.0 


0.0 


-0.5 N 


3 E? E? E% 


Fig. 4.9. Friedel sum plotted as a func- 

tion of the assumed Fermi energy 

of the crystal. Bold line for an APW 

fit to experimental Fermi-surface 

data for Cu[357]. Thin line for first- 

order perturbation theory calcula- 
tion, Eq. (3.101). 


shows that increasing Ef increases the numerical value of the long 
wavelength limit of the formfactor. It follows that determining the 


W (q), Ry 


E? = 0.36 Ry 


W'(q),Ry E? = 0.51 Ry 





Fig. 4.10. APW formfactors from fitting the experimental Fermi-surface data 
on (a) Li and (b) Cu [294, 353, 354]. 


dependence of (k | WKKR4 |k)on Ef and comparing the result with 
our prediction gives some insight into how adequate our approxima- 
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tions are. Figure 4.10 shows the APW formfactors for Li and Cu that 
were obtained in [234, 353, 354] by fitting experimental data. The 
curves are shown for different values of the assumed Fermi energy 


% and thereby for different sets of parameters. Large values of EF 
correspond to small Friedel sums, i.e., to asmall number of screening 
electrons relative to the “given” pseudopotential depth. It can be 


seen that as EF increases the PS formfactor is bound sooner or later 
to acquire a downward bend, otherwise the curve would simply miss 
the experimental points. At g = 0 the PS formfactor depends on 


EF much more strongly than elsewhere. In PS formfactors, increasing 


ç implies, by (3.101), a decrease in # and hence fewer screening 


electrons at fixed (experimental) values of W (qn). In other words, 
if the screening becomes insufficient the PS formfactor is bent down- 
ward. If the density of screening electrons is insufficient to secure 
the electroneutrality of the Wigner-Seitz cell, a characteristic down- 
ward bend may also be expected in a model potential formfactor. 

Note that were we interested in the influence of the effective me- 
dium on first-principles APW or KARZ formfactors, we should dis- 
place the MT potential as a whole rather than just Æ”. This may be 
seen from the fact that as EF rises from (1/3) Ep to Ef, the average 
potential decreases, that is, rises relative to the MT plateau. This, 
as may be seen from (3.94) and (3.101), is equivalent to a change in 
the MT shift. In a first approximation, raising the MT potential by 
AV is equivalent to taking i, (E) at energy E — AV and not at E. 
Since di,/dF is negative, A, will increase. It follows that the observed 
function A; (EF) is increasing because the computed A, (E) is decreas- 
ing. 

We have confirmed this dependence of the PS formfactor on the 
MT shift in computer calculations. 


Chapter 5 


Pseudism and the secular equations 
of band theory 


5.4. The Green’s function (or KKR) method 


5.1.1. The KKRZ secular equation. The scattering theory pseudo- 
potential introduced in Sec. 2.3 replaces the true potential in the 
evaluation of formfactors. It is also suitable for band structure cal- 
culations. 

To see this, recall that the electronic wave function which satis- 
fies the Bloch boundary condition (1.2) may be expanded in plane 
waves: 





_ Is ik+2, = 
Y= 5, 2 Bie n, (5.1) 

Using the secular equation technique described in the Introduc- 
tion, subsection 5, and taking the Lloyd pseudopotential (2.105) as 
a perturbation, we arrive after using (4.55)-(4.57) at a system of homo- 
geneous equations for the coefficients B,: 

Df (en— E) Bane +E TES, (K+ Ens K+ Bw) | Bar = 0. (5.2) 
< 

This system only yields nontrivial solutions if its determinant 
vanishes: 

f (E, k) = det | (en — E) ôn -+ X T, (E) Sy (k+ Bn» k + gn’) |-=0. (5.3) 

Equation (5.3) holds for any wave vector k; at low energies of the 
order of 1 Ry there are usually several values of Æ; that satisfy (5.3) 
for a given k (it is only these values which give rise to nonzero Bn). 
The set of E; corresponding to a given k is called the energy structure 
for the given point in k space; the Æ; (k) dependence is known as the 
ith dispersion law (or the ith band). The set of dispersion laws gives 
the band structure of the crystal. Equations of the type (5.3) which 
appear as the solubility condition for a homogeneous system are 
called secular equations. 

Equation (5.2) was first derived by Ziman in his paper [70] on the 
multiple scattering of electrons by MT potentials forming a crystal 
lattice. It may be obtained [70] by transforming the secular equation 
of the band theory method of Korringa [367] and Kohn and Rostoker 
[368] developed on the basis of the Green’s function method (see be- 
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low). The secular equation (5.3) is therefore often called the Kor- 
ringa-IX.ohn-Rostoker-Ziman (KKRZ) secular equation. 

Note the simplicity of the procedure we used to derive the KKRZ 
secular equation. We started by considering scattering from an iso- 
lated MT potential. We then chose an appropriate pseudopotential, 
expanded the desired wave function in functions which incorporate 
the crystal periodicity, and arrived at equation (5.2) for the expan- 
sion coefficients. 

Let us consider the properties of the KK RZ secular equation. First 
of all, it satisfies the empty lattice criterion: for vanishingly shallow 
potentials we have A; (Æ) = Jı (E), so that TXKRZ js identically 
zero and (5.6) reduces to the secular equation (1.27): 


det | (en — E) San’ | = 0. (5.4) 


Clearly, solutions of (5.4) are empty lattice dispersion laws. Since 
the KKRZ matrix elements are energy-dependent, it is impossible 
to determine E; by the usual technique of matrix eigenvalues (the 
matrix depends on the eigenvalues to be found). Equation (5.3) is 
therefore solved as follows. 

Considering the determinant of (5.3) to be a function of energy, 
it is evaluated for a sequence of energies, and then the roots of the 
function (that is, its intersections with the energy axis) are found. 
These roots are the energy levels required. 

In transition metal calculations we must be confident that the 
Fermi level in question will fall within the energy range chosen: 
the position of the level can only be determined using the density of 
states, which is itself obtained from the band structure calculation. 
However, the KKRZ method turns out to be unsatisfactory for ener- 
gies well above the Fermi level. The problem is that the KKRZ matrix 
elements involve the function J, (E) œ jj' (xR), that is the KKRZ 
pseudopotential is singular for energies obeying the condition 


j(RV E) =0. (5.5) 


Even though these resonances exist in pseudopotential energy 
dependences, they are totally unphysical and are therefore called “spu- 
rious poles”. The spurious pole nearest the Fermi energy will be for 
l = 0 (the lowest pole possible) for which Æ$ = (n/R)*, R being the 
muffin-tin radius. For an FCC lattice, EU/E% = 8a? (12 x? Z)-/8, 
thus showing that a spurious pole will indeed be observable near the 
Fermi level, especially in metals with large valencies Z. The occur- 
rence of singularities in the KKRZ matrix elements indicates that 
they are not small, even for large reciprocal lattice vectors. This 
means that in the determinant (5.3) large vectors g, must be included 
and the dimensionality of the secular equation becomes too large 
for the problem to be tractable. 
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In this case a singularity in the secular equation matrix elements 
is said to violate the convergence over the reciprocal lattice vectors. 
Furthermore, a singularity of the type (5.5) gives rise to a resonance 
of negative width and hence (see Sec. 4.1.3) to the unphysical (or as 
yet unaccounted for) effect of the merging of band solutions to this 
resonance level, possibly with the appearance of complex £; (k) 
values which indicate the finiteness of the electron lifetime in a given 
state. The occurrence of such states means that the electron must 
leave the crystal in the same manuer as it leaves a (single) MT scat- 
terer with a quasi-bound state (Sections 2.2, 2.3). This, however, is 
prohibited by our earlier assumption of the crystal’s infiniteness: 
the electron simply has nowhere to go. It is for this reason that we 
cousider the poles to be spurious. 

Spurious poles only arise in the KKRZ method; in the OPW or 
APW schemes they do not. Furthermore, if a secular matrix is large 
enough, the poles have no effect on the band solutions, so the APW 
and KK RZ band structures are virtually indistinguishable [346, 347]. 
The same is true of the KKR method (see below): these band struc- 
ture methods yield quantitatively close results.* We may conclude 
that spurious poles, while affecting the computational procedure, 
do not alter the final result. 


5.1.2. The KKR secular equation. The difficulties of the KKRZ 
method can be circumvented by using the KKR secular method. 
This was first done by Ziman [70] with the help of a unitary trans- 
formation. We will resort to a similar transformation which is a non- 
unitary and therefore a simpler one. 

Let us substitute an explicit expression for S,, Eq. (4.57), into 
(5.2). Summing over n’, we obtain 


(En — E) Ba +>: R27 hz, (n) >) hy, (n') Bre =0, (5.6) 


L 


where n in the argument of hz is a shorthand for k + gn. Introducing 
coefficients Cz defined by 


1 . 
— pr lL + De (n) B, =), (5.7) 


and combining (5.6) and (5.7) we find 
(En — E) Ba +>: hz (n) CL =0, (5.8) 
L 


which when substituted in (5.7), gives 
Spr hy (n) hyp (n) 
> -ar 5D ae irs 
L’ n 


* For the same potentials, see [369]. 
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We have thus arrived at a system of homogeneous equations for 
the coefficients Cz. The system will have nonzero solutions if its 
determinant is zero. Introducing 


hy (n) hye (n) 


Gir (k, E)= >) — F (5.9) 
we obtain the secular equation 
Ôr y> 
det | Grr (k, E) -——_*, — |=0 (5.10) 


R? (E)— Ti (E) 
which is the KKR secular equation (or rather it can be made so after 
a little algebra). 
We consider first the matrix G,,-. Clearly, Grr; (k, E) is generated 
from a function G (r, r’): 
G(r, r')= © Gre (k, E)Y, (r) Yx (r). (5.11) 


— 


LL 


Substituting (5.9) in (5.14) yields 
i(k e —i(k+g, )r’ 
| i XI e “ne Pn - 
G(r, ¢)=— G2 aE (5.12) 
n 
Since e, are the “eigenvalues” and Q7! exp li (k + gn) r] are the 
eigenfunctions of the empty lattice model, it follows by definition 
(2.9) that G (r—r’) of (5.12) is the empty lattice Green’s function. 
The involvement of this latter in the KKR secular equation is the 
reason why the KKR approach is frequently called the Green’s func- 
tion method. 


5.1.3. Structure constants. The Green’s function (5.12) may be 
represented in a somewhat different form. We can rewrite (5.12) as 
4 et- r’) 


Ci r= D fokte a r ag 


where the integration is over all reciprocal space. Representing the 
ô-function as 





1 ( were -Qr 
ô (k + gn, — q) = apt | e Eten- Orgar, 


where the integration extends throughout the crystal, we have 


oia- r’) 


G (T, r’) = -T \ | elk -arı È e'f": | gee adr. 
n 


The sum over » is zero for an arbitrary r, and goes to infinity each 
time when r, = t, (cf. the structure factor (1.22)). Taking into account 
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the plane wave normalization, we have 


+ Ue eñn" — => 6 (r, —ty) 
v 
giving 
a iq(r-r’—{,,) 
e 


G (r, r')= — +e Seite dq. (5.13) 


(231)? q7—E 


A similar integral was encountered in expression (2.11) for the 
free-electron Green’s function Go: 








! dq -=G 1 eVES 5.14 

wis | q= (|x |)= —z ESN (5.14) 
Substituting (5.14) into (5.13) yields 

G(r, r) =>) G,(|r—r’ —t, |) ev. (5.15) 


The empty-lattice electron Green's function G (r, r’) isthusa Bloch 
combination of free-electron Green’s functions centered on various 
lattice sites. This is reminiscent of the combination of atomic orbitals 
used in the LCAO method, Eq. (1.15). The empty lattice Green’s func- 
tion is periodic throughout the crystal, as is any function which satis- 
fies the Bloch boundary condition (1.2). 

Making use of expansion (2.36), the matrix elements Gg, can be 
rewritten as 


Gov (lri |r l, E, k) 
= Arr (E, k) ji (r<x) je (r>x) -— “Orn TI (rex) nı (rsx), (5.16) 
where Arr: is defined by the identity: 


drr = (5.17) 


ji ere) Grr ip (ers) 
Thus there are two forms for Azz. The first one follows from (5.9) 

and (5.17): 

1 hy (n) hy, (n) n 


Arr = -—>— — x61 Tr 


hip 4 ELE (5.18) 


The second form can be obtained using (5.15), (5.14), and (2.34)-(2.36): 
Ary = 40S CDr (E, k), (5.19) 
= 
where Cr: are the Gaunt coefficients (2.21) and 


Dy, (Esk) =% Se try (xt) — thr (HL) Vz (bh) ee. (5.20) 
v0 ` 
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More complicated but more rapidly converging expansions for D, 
have been derived in [368, 370-372]. 


3.1.4. Properties of the KKR secular equation. Substituting (5.16) 
in (5.10) and using the identity (2.106) gives the KKR secular equa- 
tion in the form it is generally used: 


det | Apr (E, k) + V E &zr- cot n (E)|=0. (5.21) 


A point to note about this form is the appearance of the phase 
shift n; for MT potential scattering. (The partition (5.16) of the Green’s 
function that gives rise to Azz, is related to some subtleties of the 
Green function formalism which have to do with singularities the Green’s 
function may have at r = r’.) The coefficients Azz can be shown to 
be independent of the choice of r and r’. This can easily be seen from 
expression (5.20) which does not involve r and r’ explicitly. The 
Green’s function (5.9) has thus been divided into regular and irregular 
{atr =r’) parts, one of which is Ay ,-j,j, (all the spatial dependence 
being included in the j,j, product) and the other is xj)n,;. It is easy 
to see that at r = r’ the derivative of the regular part is continuous, 
whereas that of the irregular part is discontinuous (because of the 
linear independence between jp (x) and n; (x)). The irregular part 
was combined with the second term in the secular equation matrix 
element (5.10) and, since this term has no singularities at r = r’, 
the r = 7’ = R limit can be taken in either term of the matrix ele- 
ment. As a result, the secular equation (5.21) involves the range of the 
potential R instead of r, r’ and cot y; is evaluated at r = R. 

Now what is remarkable about the KKR secular equation? 

First of all, the structure and potential parts are separated which 
is reminiscent of the factorized matrix elements of the pseudopotential 
method. The matrix elements A,,- depend on the wave vector k, 
the energy Æ, the lattice type, and lattice parameter a, but do not 
depend on the crystal potential; they are called therefore structure 
constants. 

Let us consider the dependence of the structure constants on the 
lattice parameter. It is appropriate to introduce variables and func- 
tions that are independent of the specific value of a (we designate them 
by waved letters): 

~ ~ ~ a \2 ~ 2 
keik; go=feni ens (E) en E=) E (6.22) 
where &p is the Bohr (or any other fixed) radius. 

Let us substitute (5.22) into (2.33). Since R is the MT (or atomic) 
radius, we have R œ a; |k + gn | R is therefore independent of a. 
Since h, (n) involves Q, we obtain 


hn (n) = (alao)? hy, (n). (5.23) 
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Substituting (5.22) and (5.23) in (5.21) gives 
Arr = (a/a) Arr’, (5.24) 
det | A, (Š, k) + VŽ êrr cotn (E)| =0, (5.25) 


where Azz. is independent of the specific lattice parameter. Given 
the structure constants for one substance, it is a simple matter to 
find them for a substance with different lattice parameter (for the 
same lattice symmetry). 


In practice, instead of the structure constants Azz we evaluate 


the coefficients D, of (5.19), which are less in number. The structure 
constants are rather inconvenient in being dependent on energy; 
this makes it necessary to tabulate them with a very small energy 
spacing. For this reason tables of the KKR structure constants [373, 
374] are only available for a limited number of points in the Brillouin 
zone, namely, for those lying on high symmetry axes. 

The Z convergence of the KKR secular equation (5.21) is exception- 
ally rapid: the first few orbital numbers are sufficient to get steady 
solutions. Jn practice, max is taken to be 2 which means that only s, 
p and d components are included in the Lloyd pseudopotential. This 
is connected with inequality (2.89). Because of this rapid convergence, 
the number of coefficients D; nowhere exceeds 25 ink space, whatever 
the energy. The explanation is that the Gaunt coefficients Cir 
vanish for 1” >l l, i.e., the maximum orbital number hay in 
(5.19) is 4 (if lmaz = lmax = 2) and the total number of terms in 
this equation is (max — 1)?. Some of the other properties of the Gaunt 
coefficients reduce the necessary number of D, still further. 

The way dispersion laws are determined in the KKR method is 
similar to that used in the KK RZ scheme: the determinant is evaluat- 
ed over a certain energy range and then the energies that make it 
zero are determined. The energy dependence of the structure constants 
makes this procedure rather difficult because it is very nonlinear in 
energy; it has singularities when E = e,. We saw, however, that the 
band solutions are pushed away from the empty lattice dispersion 
laws (Sec. 4.1.3). In computer calculations, some measures must 
therefore be taken to get around these singularities. 


5.1.5. Features of the energy band structure. The KKR secular 
equation in the form (5.21) yields information as to the energy band 
structure. In a transition metal, we know that the tangent of the 
phase shift has a resonance in energy (see (2.80)). The simplest way 
to write this down is to neglect nonresonant scattering with phase 1): 


T(E) _ PF 


~ 


Ei—E 
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where £, and I are, respectively, the energy and width of the quasi- 
bound state. For the d band we have a 5 x 5 secular equation (neglect- 
ing for the moment ds and dp hybridization): 


det | (Ea— E) 8mm -+ X Imm: (t,) eX | =0, (5.26) 
v0 
where we have used (5.19) and introduced 


Inm (ty) =T pa Chin, om (Mw (uty) — ijr (xty))Y r (ty). (5.27) 


It can be seen that (5.26) has a typical form of an LCAO secular equa- 
tion. Ea plays the role of an initial atomic level that splits into a 
narrow band, while Z mm’ corresponds to a two-center overlap integral 
responsible for the finite band width. An interesting fact following 
from symmetry considerations [375, 378] is that J, is a sum of the 
LCAO-type contributions known as the ddo, ddn and ddô integrals 
[379, 37]. 

The width [ of the resonant level determines the magnitude of 
the overlap integral Z mm and hence the width of the d band. The larg- 
er T is, the wider the d band. Since l (E) x E!+'/2 we obtain in view 
of (9.22) 


T (Eq) œ E¥/2 œ a. (5.28) 


Since the position of a quasi-bound level in a solid is determined by 
the position of the corresponding bound level in the atom, [ depends 
on the metal’s position in the periodic table in the same manner as 
the atomic level does. The variation in atomic level was illustrated 
in Fig. 3.3. We may conclude that d bands get lower in energy and 
narrower along a period. The exception is middle-period metals: 
their dëst atomic configuration leads to a rise in the atomic level and 
hence to a broadening of the d band. This is illustrated in Fig. 5.1. 

The KKR secular equation (5.25) suggests, to a first approximation, 
that metals having the same crystal lattice have similar band struc- 
tures: the differences may be ascribed to the difference in interatomic 
separations. In higher approximations the difference in crystal po- 
tential should be taken into account. Figure 5.1 illustrates this con- 
clusion too. 

Some conclusions can be drawn from secular equation (5.25) as 
to the way the band structure depends on pressure when a crystal is 
uniformly compressed. It can be seen from (5.22) that the positions 


of all the energy levels change as a~* (E œ Ea). To a first approx- 
imation, the d band width increases with uniform compression as 
the inverse square of the lattice parameter; in the next approximation, 
by (5.28), it is proportional to the inverse fifth power. A numerical 
experiment [380] supports the a~> dependence, first pointed out in 
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{305] from somewhat different considerations. It is also clear that 
pressure will raise all the levels in the crystal and so will increase 
its total energy, which is unfavorable. It follows that increasing pres- 
sure increases the probability for the system to change to another crys- 
tal type, whose energy levels will be arranged in such a way as to 
lower the total energy. 


v Cr Fe 


E,Ry 


Ni Cu 





r x r X r x 


Fig. 5.4. The band structure of BCC (above) and FCC (below) transition metals. 
W is the band width. 


Band theory formalism has thus allowed us, without going into 
mathematical detail, to draw very general conclusions concerning 
the physics of crystals. A similar situation was encountered in Sec. 2.3 
in the discussion of the periodic table in terms of pseudism. The KKR 
approach is a natural generalization of the scattering-theory pseudo- 
potential method to the case of an infinite number of scatterers. 

It would therefore be interesting to derive the KKR secular equa- 
tion by the same method we used for obtaining the phase shifts for 
a single scatterer, namely, by using integral equation (2.4). 
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9.1.6. Green's function formalism. As is the case with an isolated 
potential, an electronic wave function in a crystal may be written 
as a superposition of the radial solutions @,; of Schrédinger’s equa- 
tion. The effect of the other scatterers (which are identical to the given 
one) should be taken into account by subjecting the wave function 
to appropriate boundary conditions. What is appealing about the 
Green’s function representation (2.4) is that, provided the Green’s 
function has been evaluated for the particular boundary conditions, 
these are then directly incorporated into the equation. 

The Green’s function formalism we discuss below is simple to use 
and may be applied to any infinite system of scatterers. The use of 
expression (5.16) allows us to consider both ordered and disordered 
crystals, the periodicity of the lattice not being invoked. It will be un- 
derstood that the only model assumption we make in this section is 
that the potential is finite in range. The KKR technique is simply a 
development of the scattering theory formalism outlined in Sec. 2.1. 

The important point to note is that in an infinite crystal there is 
no incident wave: the crystal fills the whole space and there is no- 
where a (formerly free) electron might come from. The immediate 
implication is that Eq. (2.4) does not contain a solution 9 of the homo- 
geneous equation; its absence is dictated by the boundary conditions. 

The integral equation for ¥ thus takes the form 


¥ (x)= | G(r, 1) V (ry) ¥ (1) ery, (5.29) 


where V is the single-site potential and G is the “empty crystal” 
Green's function, G = $)G, (r — r; — ty). 


y 
On the other hand, as already mentioned, ¥ is made up of solu- 
tions of the radial Schrödinger equation (the weights of this superpo- 
sition have to be determined from (5.29)): 


Y (r) = 2 CR lr) Y, (r). (5.30) 


Substituting (5.11) and (5.30) into (5.29), we find after integrating 
over the angles that 
R 


CLA = > Cy \ Gir (T, ra) V (ra) Av (ra) ridra. (8.34) 
L’ 0 


We shall assume r to be somewhat larger than the range of the poten- 
tial R. Then the integration over r; in (5.31) does not change the form 
of Gr as a function of re or rs (because r < R <r). 

Let us consider the integral in (5.31). Making use of the radial 
equation we write V.#, as a differential operator and then take the 
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integral by parts: 


CrAL= > fr | Grr = Ry (rı) -- Ry (ae Gu |, 


, 


1 @ ô L-41 2 
+\ Fe) aR ta R HE] Grrridr} Cu. (6.32) 


t @r, ri 


Diaa 


Substituting (5.16) into (5.32) we find the integral to be ôr 2 (r = 
R). This means that since in ther —r; — R limit Ž, (r) smoothly 
joins to &, (A), we obtain a system of linear homogeneous equa- 
tions for the unknown quantities C; (at rı = 0 the Wronskian in 


(5.32) is zero): 
5 [Gre (r, r) Go Pr) Rr (r) 3 Grr (r, r) | Cr =0. (5-33) 
< 


By substituting (5.16) into (5.33) and collecting like terms we find 
(Areri (iv Ri — iv Av) + VE reji (nv At — ny Ar) Cr =O. 


nM 


If we introduce a new coefficient @,, viz. 


an = (ji (4R) FA (E, r)— Ri (E, R) FZ h (4R)) Cz (5.34) 


T= 
and take j, out of the summation, we retrieve the earlier derived form: 


> (Arr (E, k) +V E ôr cot n (E)) ar =0. (5.35) 
The condition for the existence of nonzero coefficients a; is the KKR 
secular equation (5.21). The coefficients Cz determined from (5.34) 
and (5.35) must coincide with Cz found from (5.7) because they are 
solutions of the same system of linear homogeneous equations. 

We thus see that scattering theory makes it possible to change 
from a “wave vector” representation (of the NFE type) to a represen- 
tation of “orbital numbers” of a single-site potential (the LCAO 
type). This being so, can we conceive of a model which would be 
opposite to the empty lattice or the NFE model? 


5.1.7. Solitary cell model. If in the LCAO scheme the lattice 
parameter tends to infinity, we get the limiting case of free atoms. 
In the KKR (or Green’s function) method we have imposed a specific 
boundary condition, the absence of any incoming wave. Thus an iso- 
lated scatterer cannot be a limiting case much as in the NFE model 
the limiting case is not a simple plane wave |k) with a dispersion 
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k?, but rather a set of plane waves |k +- g,) with dispersions (k + 
gp)? 

So let us consider the KKR secular equation and take the limit 
as the lattice parameter tends to infinity keeping the range of the 
potential R constant. The resulting model clearly will be an infinite 
number of infinitely separated Wigner-Seitz cells. We shall call it 
the solitary cell model. This is not to be confused with the isolated 
cell model of Sec. 3.4 in which the boundary conditions were taken 
to be those of free electrons. Here we retain the Bloch boundary con- 
ditions, i.e., the Born-Karman conditions. 

What then are the dispersions of a solitary cell model? 

Consider the KKR secular equation (5.10). An increase in a for 
constant R means that |k + g, | R œ a and tends to zero, i.e., 
jı (lk +g, | B) —ôzo and, by (2.33) and (5.9), 


— (4x)? Ôr, oor, 0 1 2 -3 . 
Gu oe, (pe) Ke (5.36) 


which, combined with (5.10) gives the secular equation for the soli- 
tary cell model: 


Tt 


det 





Orr - 
maJ |= (5.37) 
As might be expected, the orbital momentum is the motion inte- 
gral in this model and (5.37) contains 6,,-, that is, as is the case in 
a free atom, there is no mixing of different orbital numbers. For any 
given l there are an infinite number of solutions arising when 


(4, (E) — J (Ey =0. (5.38) 


Equation (5.38) is fulfilled whenever Æ equals the energy of a sin- 
gularity of A, or J, that is, there are two sorts of solution of the soli- 
tary cell model, 


E=, E=E), (5.39) 


where Ef is defined by (5.5) and e; is the energy of the singularity 
of the logarithmic derivative A, related, by (2. 83), to the energy E, 
of a quasi-bound state. 

It may be said that stretching the crystal narrows the energy 
bands and as a — oo they “merge” into the g, level. These, like 
E (k), are stationary (their lifetimes are infinite) and are found from 
the solitary-cell eigenvalue problem, but with the boundary condi- 
tions being taken for the crystal rather than for the atom. These levels 
may thus arise for both negative and positive Eyy. As is the case 
with atomic levels, they are also separated by forbidden gaps; they 
cannot be considered as states located within a continuum of “free- 
electron” levels because these latter have all merged into the levels 
of the solitary cell. 
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Substituting the parametrization (4.80) into (5.37) we obtain* 
a typical eigenvalue problem similar to the empty-lattice secular 
equation (5.4): 

det | (e; — E) ôr |= 0. (5.40) 


We may conclude that scattering theory described the crystal in 
terms of a model which is intermediate between the empty lattice 
and solitary cell models. Depending on what features are the most 
distinctive in the band structure of a crystal, it should be approached 
by either the NFE or LCAO method. 


5.1.8. A model Hamiltonian for the KKR secular equation. It is 
natural to ask whether the models are convertible. The answer is 
definitely yes. Consider Eqs. (5.8) and (5.7) as a system of equations 
for the unknown quantities B, and Cz. In matrix form we have 


(en — E)n hrn B, 


he. _ êw - 1=0. (5.41) 
Ln RT, C, 
Using (4.80) we obtain the secular equation 
act (GP San V Vhia )=0 (5.42) 
V Vihtre (e: — E) ôzr 


which is valid for E close to e;. The upper left-hand diagonal block 
corresponds to the empty lattice model, and the lower right-hand 
one corresponds to the solitary cell model. The two off-diagonal blocks 
describe their hybridization. 

For an infinitely sharp 4, resonance, the resonance width y, and 
thereby the hybridization elements in (5.42) are zero, which turns 
the crystal band structure into a set of empty lattice dispersions com- 
bined with the Æ} levels and the solitary cell levels. A finite resonance 
width implies “interaction” between the two models. 

Equation (5.42) leads to a widely used transition-metal band- 
structure model [70, 73, 3] which is a combination of a wide NFE- 
type E — k? band with the narrow d level (E = £1.2) typical of the 
LCAO model. This gives rise to a 2 x 2 determinant. A solution of 
the arising equation is illustrated in Fig. 4.6. 

Of course, even in such a simple model as this, we must account 
for the five-fold degeneracy of the d level. The resulting determinant 
will then be 6 x 6 and the secular equation may be interpreted in 
the following way. The perturbation introduced by the free-electron 
band (that is, continuum levels) splits the five-fold degenerate €g 


* Near E ~ g, the quantity 7, is small compared with A, and may be neg- 
lected leaving only the singular term, A; — Jı œx (E —6))71. 


13—01063 
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level into levels that start interacting with each other through the 
continuum, thus giving rise to an LCAO band. This is precisely the 
effect we encountered in the LCAO model: the change in boundary 
conditions relative to those occurring in the atom (that is, the appear- 
ance of a continuum) was equivalent there to a perturbation that. 
“smeared out” a level into a band. 

If the resonance in A; is unimportant for orbital numbers other 
than two, it is sensible to retain such terms in the NFE block rather 
than including them into the LCAO-type block. This will result in 
a typical NFE pseudopotential and in the loss of diagonality in 
this block, that is, empty lattice dispersion laws will hybridize in 
the same way they doin s- and p-band metals. The infinite dimension- 
ality of the determinant (resulting from the plane-wave expansion) 
may be reduced by means of the Löwdin convolution procedure as 
discussed in the Introduction. This will bring about correction terms 
similar to the KKR structure constants A;,- (compare the expression 
(5.9) for Gr, with the second-order term in (1.31)). Substituting 
these corrections in the form of lattice sums (as done in (5.26)) we 
will see that in this case the KKR method involves the interaction 
of a typically narrow LCAO band with a typically wide NFE band. 
describable in terms of the pseudopotential method. 

A procedure of this kind was first proposed by Heine [305], and 
was later developed by Hubbard [81, 306, 307], Jacobs [375, 381, 382] 
and Pettifor [376, 383-385]. It greatly simplifies the (nonlinear) 
KKR secular equation by reducing it to an eigenvalue problem. 
Along the same lines, Koelling [3886] treated the APW method. 

Although the-parametrization (4.80) clearly reduces the accuracy 
of solutions, Hleine-Hubbard’s “model Hamiltonian” method (of 
which there are a number of varieties) is quite a reliable tool [387- 
391] for both qualitative and semiquantitative band studies. The 
model Hamiltonian method may be said to transform the singularity 
in the pseudopotential formfactor in such a way as to give rise to a 
resonance level explicitly. This procedure is the reverse to the one 
leading, through Eqs. (4.35)-(4.45), to the Harrison pseudopotential 
(4.32). 


‘9.4.9. The overcompleteness of a basis. There is a great similarity 
between (5.41) and (4.38). Since Eq. (5.41) is entirely equivalent to 
the initial problem in both the KKRZ and KKR methods, it turns 
out that in the Green’s function method the-wave function at r = R 
is expanded in terms of plane waves and #,Y, functions simulta- 
neously: 


¥ 0) =D By [K+ gn) + CLAY: C) (5.43) 


(here we have rewritten the vector-column representation of (5.41)). 
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It should be recalled now that a basis which is complete for all 
space will be overcomplete for a Fourier transformation into a (finite) 
subspace. But the introduction of the pseudopotential was in fact 
equivalent to replacing a part of space by a “black box” whose actual 
properties were irrelevant, provided the true scattering picture was 
not distorted. This means that the wave function inside the box is 
expanded in terms of functions other than plane waves. This leads 
to this part of space being disregarded in a Fourier analysis. But then 
a plane-wave expansion is always performed in pseudopotential theo- 
ry in a certain (coordinate) subspace and hence all pseudopotential 
methods must be overcomplete. 

The KKR method has for a long time not been considered to be 
overcomplete [13]. The treatment above shows, however, that this 
is not the case. Using the theory of model Hamiltonians [70, 305, 
81, 307] we have arrived at a generalized secular equation (5.41) 
and we have shown thereby that since it comes from scattering theo- 
ry, the KKR method is bound to include an overcomplete basis ex- 
pansion. This is clearly illustrated by formula (5.43) in which the 
original plane-wave basis is complemented by a superposition of 
AY type solutions. We thus conclude that all pseudopotential 
band-theory methods are overcomplete and, importantly, the KKR 
method is closely related to the others. 

In the next section we discuss in more detail the pseudism of sec- 
ular equations. We show that what makes the KKR method over- 
complete is the introduction of scattering phase shifts (which is 
equivalent, because of modulo-x indeterminacy, to using pseudopo- 
tentials). 


3.1.10. Lattice of perfectly hard spheres. In conclusion it should be 
noted that when # = e), a solitary cell scatters the same way a per- 
fectly hard sphere does (cf. (2.73), (2.74)). We saw in Sec. 2.4 that 
in pseudopotential theory a perfectly hard sphere is the strongest 
possible perturbation. Suppose we are able to calculate the band 
structure of a crystal whose lattice-site MT-potentials are repre- 
sented by hard spheres. For strong scattering we shall then be able to 
derive pseudopotential-type expressions using as a small perturbation 
parameter the departure of the true potential from that of a perfectly 
hard sphere (this, of course, in the sense of scattering and not in the 
sense of their difference in coordinate space). 

The band structure of such a “crystal” is easily found with KKR 
method. For a perfectly hard sphere, the logarithmic derivative A, 
is infinite for any energy, 


Ti (E) | perfectly hard sphere = © (5.44) 
which, when substituted into (5.10), gives the secular equation 
det | Grr (E, k) |= 0. (5.45) 


13* 
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This scheme varies from the empty lattice model in a somewhat 
different way than does the solitary cell model: the interatomic sep- 
aration is taken to be that of the real crystal while the single-site 
polentials, instead of being infinitely shallow, are assumed to be 
infinitely deep (high). Unlike the empty lattice model, the disper- 
sion laws are determined by energiessatisfying (5.45)—by the Green’s 
function zeroes, as it were—rather than hy the Green’s function poles 
(located at # = e,,). These are characteristic energy levels for the 
crystal structure which are related to the singularities in A,, that is, 
to the energies of the quasi-bound states produced by the potential. 
We might even suggest that there is a direct relationship between 
these energies and the crystal structure because it is only the structure 
that determines the Green’s function in (5.45). This has not yet been 
verified. 

Note also that in the first (diagonal) approximation the solutions 
of (5.45) coincide with the energies of the potential-insensitive points 
of the Brillouin zone (see Sec. 4.1). This supports our idea that cer- 
tain “structure-sensitive” energies are important when forming a 
band picture. 


5.2. Pseudopotential secular equations * 


3.2.1. Boundary conditions on the wave function. Let us return to 
the Schrödinger equation for an isolated scatterer described by the 
potential V (r). This equation can be solved for any energy if the 
wave function is not subject to boundary conditions. If it is, the 
equation is only solvable for certain energy values, the set of which 
forms what is called the spectrum of the problem. 

We saw in the previous section that boundary conditions can be 
incorporated in the problem by writing the Schrödinger equation in 
an integral form and using an appropriate Green’s function. The me- 
thod we present in thissection is more flexible and includes the Green’s 
function formalism as a special case. Moreover, it gives us better 
insight into the meaning of band-theory pseudopotentials. 

Instead of imposing boundary conditions on the wave function V 
itself we may require that they be satisfied by trial functions @,. 
Clearly, a linear combination of such functions will also satisfy 
these conditions. The requirement that the wave function ¥ be ex- 
pandable in terms of p, is therefore equivalent to imposing relevant 
boundary conditions on W. 

In fact, we are already familiar with this procedure. In the NFE 
or LCAO models we expanded the required wave function in appro- 


* The authors are grateful to Professors V. L. Bonch-Bruevich and I. V. Aba- 
renkov for fruitful critisism of the first version of the theory discussed in this 
seclion. 
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priate trial functions, substituted the expansion into the Schrö- 
dinger equation and obtained a secular equation by demanding that 
this wave function should be a solution. 

We may take a different approach. We may express ¥ in terms of 
solutions of the Schrodinger equation for an arbitrary energy and 
then make this expression smoothly join to an expansion in trial 
functions obeying the boundary conditions. Clearly this will also 
be equivalent to imposing boundary conditions on the wave function. 


5.2.2. Matching wave functions. Consider an infinite laitice of 
identical MT scatterers. The wave function scattered by the zeroth 
site is (cf. (5.30)) 


Pout (1) =) CA0, E)Y, (r). (5.46) 


We require that the wave incident on a given MT sphere should ke 
a superposition of waves scattered by all the other spheres, viz., 


Vine(t)= X CLA (l r— tl, E) Y: (r—t), (5.47) 


where t’s indicate the positions of sites and 2, is the solution of the 
radial Schrödinger equation for the lth site. 

If we now require that the wave scattered by the zeroth site smooth- 
ly join to the incident wave at the boundary of the MT sphere, 
we obtain the condition (for r = R) 


Fout = Vines (5.48) 
2.4 
4 Pout = 2 Pines 
r dr 


which must be transformed to the multiple-scattering-theory secu- 
lar equation for determining the energy spectrum and wave functions 
of the problem. The procedure is as follows. 

We start by writing (5.47) for a point r on the MT plateau. Note 
that (5.46) and (5.47) are defined in this region (as is (5.48)) while 
inside the MT sphere (9.47) is not defined. We may rewrite (5.47) 
by adding and subtracting the scattered wave (5.46). This yields 


Pine => 2 CLR ( jr—t |) Y, (r— t)— Pout (r), (5.49) 


where the first term is the one-electron wave function of the crystal 
Y (r), so that 

Wine £) = Y (r) — Fout (r), (5.50) 
which when rewritten as 

Y (r). = Wine (r) + Pout (r) 


expresses the obvious fact that the wave on the MT plateau is a super- 
position of waves coming from all the scatterers. 
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Setting r = R+ and substituting (5.50) into (5.48) yields 
Pout (f_) = VF (Rs) — Pout (R+), 


5.51) 
== + (Y (r) — Fout (r) 


d 


— Y 
dr t| =R r=Ry 


Since ¥ is a Bloch function, that is, Y = et" uy (r) where Up (r) is 
periodic over the direct lattice, it follows that at each point of the 
lattice we may expand ¥ over the reciprocal lattice: 
jy tom i(kt+g,)e er 
To= _, Bre no, (5.52) 
n 

where Q is the unit cell volume. Substituting (5.46) and (5.52) into 
(5.51), using the plane wave expansion (2.32) and multiplying by 
Y, (r) on both sides, we find after integrating over all r directions 


Cr (A(R) + Ri (Re) = 25 Bahi (n, R4), 
(5.53) 





Cr (FM| a + 


d H 
=R_ dr Ml, 


a = J B, = h; (n, R+). 
n 

By a simple scattering-theory argument the function .@, is contin- 
uous at r = R, so we may take the limit R- +R, in the left-hand 
sides of (5.53) assuming R+ kept fixed. It will be understood that 
the two radii are not interchangeable in this context because R4 
has appeared in the right-hand side of (5.53) from (5.49), in which 
W ine is only meaningful outside an MT sphere. Finally, 


201.81 = D2) Bahi (n, R), (5.54a) 
2C Ri =X Bhi (n, R), (5.54b) 


where obvious notations have been dropped and (d/dr) replaced by 
the prime. Dividing (5.54a) by (5.54b) and collecting like terms yields 
Di (hi (n) — hy (n) ^) By = 0, (5.55) 
n 
where à; is the logarithmic derivative familiar from scattering theory 
and all functions are taken at r — R, that is, on the edge of the MT 
potential. This system of equations provides, in principle, the energy 
spectrum of the problem but is not acceptable from the computation- 
al point of view. Incidentally, the system satishes the empty-lattice 
criterion in that, for an infinitely shallow MT potential, the function 
RR, tends to j; (rV E) and if E = en for a certain n, then the deter- 
minant of the system vanishes thus providing a condition from which 
the energy spectrum is determined. 
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What makes the system (5.55) impractical is that its equations 
are written in a mixed representation, i.e., the matrix elements are 
labeled by the quantum numbers n and L. It would be desirable to 
transform the system (5.54) in such a way as to obtain matrix ele- 
ments labeled by identical quantum numbers, for example, by L 
and ZL’ in the orbital momentum representation. Clearly we would 
achieve this goal if we were able to express the coefficients B, in terms 
of C,. There are difficulties, however, which arise from the matrix 
dh, (n) not being reversible. 


5.2.3. The coefficients B,, in terms of Cr. Let us define coefficients 
Zz such that 


1 1 
Bn = z7 X; hr (a, r')ar, (5.56) 
L 





where r’ is different from r, €n = (k + g,)*, and F is a certain energy. 
The idea is that a relation between the a; and C; must be easier to 
find out than that between the B, and Cz. 

Substituting (5.56) into (5.54) and defining (for any t, r’) 


hr (n, r) hy, (n, r’) 


Pir (r, r’) = > ~an F (5.57) 
equations (5.54) become 
2Cr R= 2 Pian, (5.08a) 
2C Ri = D (E rr) avs (5.58b) 
Bp 


where r = Rx while r’ is so far arbitrary. 

The matrix [,,- is easily recognized as the orbital-momentum 
representation of an empty-lattice Green’s function for an electron of 
energy F. It should be noted that F need not coincide with Æ in 
(5.46) and in (5.47). The energy Æ identifies the state we are consid- 
ering whereas / enters our discussion through equation (5.56), which 
thus far only expresses our belief in there being certain a,;’s which 
make (5.96) true. The existence of the a, will be proved if a relation 
between a, and C, is found. If this proof turns out to be independent 
of F, then the a, exist independently of F, and (5.56) holds for any F, 
including F Æ E. 

In order to carry out the proof, let us first find the Wronskian of 
the [';;- matrix. We start with defining a function corresponding 
to lir matrix, 


Tor, r) =Ù Tir (r, re’) Y, (r)Yg (r), 


LL’ 
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which may be rewritten as 


P(r, y= g D SPE e (5.59) 


using (2.32). It will be seen that the last two equations are Eqs. (5.11) 
and (5.12) in which Æ is replaced by F. Using (5.16) we write 


Pie, r) = 2 Cie (fr) Arr (hy F) jv (fr) 
+ fôrse jilfre) mi (Fr) Vie) Yr (r), (5.60) 
where f = + VF, and r< (rs) is the smallest (largest) of the radii r, r’. 
Note that the matrix Azz does not depend on vectors r and r’. 
From (9.59) and (5.60) we find 
Pie = ji (fr) Arre ivy fr’) + frr ji (fr<) r (fs). (5.61) 


Assuming r >r’, which means that r= r and re = r’, we diffe- 
rentiate (5.61) with respect to r to obtain 


d a . . , , 
T Prr = jrr jv + fore ji (fr) ni (fr). (9.62) 


We now multiply (5.61) by j; (fr), (5.62) by jı (fr), and subtract one 
from the other. This yields 


a . d 
ji (fr) Tee — ji (fr) Tr Pre 


= 0-Arrjr + fre ji (fr) (n (fr) ii (fr) — r (Fr) ji G 


Using theorems on the Wronskians of the Bessel and Neumann func- 
tions we find a theorem on the Wronskian of the empty-lattice Green’s 
function, viz. 


. d d 1 opt , = 

Ji (fr) > Pers Pee i (fr) == reji (r f) for rœr', (5.63a) 
; ad. / 
Ji (fr) 2 Pe Te yN (fr) =Q for rar. (5.63b) 


We are now in the position to find a relation between a, and Cr. 
Let us multiply (5.58a) by j; (fR), (5.58b) by jı (fR) and assume r’ < R. 
Subtracting one equation from the other and using (5.63) we obtain 


. ome o 1 a , 
20,7, 2i— jR) = > RP Bros tr’) ay: (5.64) 


L’ 
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from which the required relation is 


2R? f. d , 
soars [i (PR) Fe FE. r) 


ar = 
r r=R 


do, p 
—Sirlir)| pE, RIC, (5.65) 
showing that the a; coefficients do exist and are defined for any F. 


5.2.4. Derivation of the secular equation. Denoting the term in the- 
square brackets as [j,, 2,] we write 


2k? o> 
OL gry Ue Ril Cy. (5.66): 
Using (5.61) and (5.66) we find from (5.58a) 


jı > jvar (Are (hy F)+ sret (Bt) =0, (8.67) 


where (2.30) was used for functions of fr. The condition for there- 
existing nontrivial solutions of (5.67) is the required secular equation 

det | Arr (F, k) +) Fôr cotn (F, E)|=0, (5.68): 
where (y = fR), 


_ —1 h(n) hg. (n) nı (h) 
Arr (F, ‘=F, w SF — fôr 71 W) ; (5.69) 





cot N (F, E) = (ji (Y) — Mji (y) (ni (y) — M (E) ni (y)). (5-70) 


5.2.5. Pseudism in the secular equation. We have thus arrived at. 
phase shifts defined outside the isoenergetic surface. Although 
similar to phase shifts discussed in Sec. 4.5.2, these shifts are of en-- 
tirely different origin. In Sec. 4.5.2 they resulted from the atomic 
sphere approximation whereas in the present formalism they appear: 
as a manifestation of the “structural” pseudopotential and are re- 
lated to the arbitrariness inherent in the transformation (5.56) (or 
the arbitrariness in the choice of the coefficients a,). 

In the atomic sphere approximation (Sec. 3.4.4) the MT radius. 
was taken to be equal to the Wigner-Seitz radius. This is not neces- 
sary in the present context, but the fact that the matching radius is- 
a variational parameter is consistent with the atomic sphere model 
(Sec. 4.5.3). 


With the simplest possible choice for the variational parameter F, 
F=0, (5.74) 
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“we obtain a simple secular equation 


—0 (5.72) 


hy (n)h,- (n) Ôr p? 
, 2 5 Ar (n) hr LL 
det |2 2— a ROIR 


known as the LCMTO or LMTO secular equation (from the method of 
the linear combination of MT orbitals [392-397]) and first obtained 
by Anderson [245] in the atomic sphere approximation for R = R,. 

It is worthwhile discussing here the physical meaning of the KKR 
‘secular equation (5.68). Introducing, formally, 


Ory’ = — arc tan [+ Ary (F, k) | 
(5.73) 
(5.68) takes the form 
det |cot Prr — Sz, cot n; | = 0. 
(5.74) 


This equation allows a simple 
interpretation (see Fig. 5.2). The 
effective medium around a given 





Fig. 5.2. Schematic illustration of 

-a scattering process as described in 

terms of pseudopotential secular equa- 

tions. The MT “black box” is sur- 

‘rounded by a “black box” represent- 
ing the rest of the crystal. 


MT potential (a crystal lattice 
in our case) is considered as 
a scatterer. Since the phase shift 
matrix zz is nondiagonal, the 
scatterer is nonspherical. The MT 


potential in this case turns out 
to be a spherically symmetric scatterer. The requirement that the 
electron scattered by one of the two scatterers should be effectively 
ignored by the other scatterer (the requirement for the coincidence 
„of phase shifts, Eq. (5.74)) secures the stationary motion of the elec- 
tron throughout the crystal. 

In other words, the MT potential “black box” is surrounded by the 
“black box” of the lattice pseudopotential. If the phase shifts are 
-equal on their “interface”, the electron moves freely between the two. 
It is this steady-state condition which determines (by (9.74)) the 
band solution Æ (k). Note that according to Eq. (5.74) the phase 
shifts must only coincide modulo x. 

In the context of the discussion of Sec. 3.5 it may be said that the 
variational parameter F (a property of the lattice pseudopotential 
in (5.69)) and the average potential outside the MT sphere (a prop- 
‘erty of the effective medium, see Sec. 4.5) are related similarly to 
(4.104): 

F=E—V, (5.75) 


-where E is measured from the MT zero, i.e., E is Emr. 
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It is against the background of this effective medium that scatter- 
ing processes evolve. A different V, corresponds to a different “back- 
ground” space. The nearer the properties of the space to those of 
the MT potential, the smaller is the phase shift due to the MT po- 
tential, i.e., the weaker the MT pseudopotential. 

Using equations (5.6)-(5.10) and (5.16)-(5.21), the KKR-type sec- 
ular equation (5.68) can be transformed to a KKRZ-type one giv- 
ing the generalization of the KK RZ method (see (4.113), (4.63)): 


det | (en —F) 6m +X T (E, F) Sp (k-+ gn, K+Gn’)|=0. (5.76) 
L 


This equation should be solved for Æ. Applying the Löwdin convolu- 
tion procedure to (5.76) we obtain (4.117). 

It is interesting to note that the wave-vector representation can 
also be derived from the exact secular equation (5.55). To show this 
we first perform the summation in (5.55) with,a weighting factor 
hy (n’) to obtain 


>) (nn — Brn’) Bar = 0, (5.77) 


’ 


where 


ant = Dy hr (n) Môrrhre (2!) =e D M(E) Si (m, w), (5-78) 
L 


LL’ 


with S; defined by (4.57) (as in (5.76)) and 
Ann’ = 2 hy (n) hy (n'). (5.79) 


Using Eqs. (5.77)-(5.79) we finally arrive at the secular equation in 
the wave vector representation: 


det | È F, (E, en) Sr(k+gn, k+gr)|=0. (5.80) 


This equation is reminiscent of the APW secular equation ((1.19) 
combined with (4.62)) which may be rewritten as 





4nR2 jill En—8p | R) ) 


det | (en — B) (önn —- p Tg eg 


+) T(E, en) Si (n, 2’) | =0, (5.81) 
L 
where R is the range of the potential, Q, the volume of the Wigner- 


Seitz cell, and T, is given by (4.113) outside the isoenergetic surface: 
{as in (5.76)). 
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In the atomic sphere approximation we know that 
4nR?2 jill Zn— 8gp | R) 
Qo | 8n— 8p | 


(see Sec. 4.5.3, Eq. (4.118)). The only remaining term in the APW 
secular equation (5.81) is then the last one and (5.81) coincides with 
(5.80). It is also clear that if F is formally set equal to £, in the KKRZ- 
type secular equation (5.79), then all the three secular equations 
will be the same. It should be noted that within the APW method, 
the secular equation (5.80) can only be obtained in the atomic sphere 
approximation, that is, by approximating the potential, within the 
generalized KKRZ method using a specially chosen X matrix, 
(9.80) is obtained, as it were*, by the choice of the variational para- 
meter F; and finally, we have seen that (5.80) can be derived exactly 
(Eqs. (5.77)-(5.80)). This is reminiscent of the Austin-Heine-Sham 
theorem (Sec. 2.4.3) according to which there are more than one pseu- 
dopotentials by which one and the same scatterer may be described. 

To summarize, the APW and KKRZ(KKR) methods devised for 
band structure studies employ different pseudopotentials; different 
trial functions imply different “lattice” pseudopotentials, i.e., differ- 
ent effective mediums. It is for this reason that, even though the 
methods converge differently, the band structures they lead to are 
the same. 





— Bain 


* The explicit form of [, r, (5.57) and the Green's function formalism as a 


whole (which led to (5.68)) do not allow us to perform the transformation for 
F = ¢,. The similarity is therefore only formal. 


Part 2 


The use of pseudopotential theory 
for crystal-structure calculations 


Chapter 6 


Formalism of crystal-structure 
energy calculations 


6.1. Basic assumptions 


The total energy of pure metals and binary alloys has been a sub- 
ject of many pseudopotential studies and general formulae for cal- 
culating it have been obtained by many investigators even though 
the underlying assumptions are essentially the same. A crystal is 
represented as a system of N periodically arranged positive ions (of 
one or two species) immersed in an electron gas. For the (more gene- 


ral) case of an alloy, the density of the gas is pọ == Z/Q, where Q 


is the average atomic volume of the alloy and Z = (1 —c)Z, + cZ, 
is the average atomic valency, Z4, Zg being the valencies of the A 
and B components and (1 — c) and c their respective atomic concen- 
trations. For a pure crystal of, say, A atoms, cis zero and pọ = Z,/Q,, 
as it must. 

The total energy of a crystal may be regarded as the sum of two 
contributions, one of which depends only on the volume, and the 
other, formally, only on the structure. The former, U,, includes 
terms which are independent of the atomic distribution, namely, 
the kinetic energy of free electrons, the exchange-correlation energy, 
etc. [17, 73]. The structure-dependent energy is the sum of the elec- 
trostatic energy Ues and the band energy Ups. An expression for Up 
may be found elsewhere [17, 73]; the derivation of U.s and Ups is 
given in this chapter. 


6.2. Band structure energy of pure metals 
and binary alloys 


The first calculations of an alloy’s band energy were attempted 
in [398] for a nonlocal and in [399] for local potential. The latter case 
is easier to handle and will be discussed first. The discussion is mainly 
based on [400-404]. 
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We showed in the Introduction that second-order perturbation 
theory approximates the band energy to* 
Q ‘ aad 1—e (q) 

Unge D PIWO) PS (6.1) 
where #8. (q) is the formfactor of the crystal pseudopotential. 
In order to obtain Uy, explicitly, the crystal pseudopotential must 
be expressed either in terms of the ion pseudopotentials or their 
formfactors. Recall that 7è (r) and its Fourier transform are re- 
lated by 


We(Qa= \ eTit IG, (r) ek" dr. (6.2) 
Next we represent 7°} (r) in terms of ion pseudopotentials 


GI 1 5.3 
Wve (r) =F D>) Won (— tvu)» (6.3) 


vie 


where v labels the lattice site and u indicates the species of the ion. 

In the following treatment, we shall assume the ions of a crystal 
to occupy all (and only) the lattice sites. In a real problem, this per- 
fect order may be violated by thermal vibrations, lattice defects, or 
the difference between the atomic radii of the components. These 
effects can, in principle, be included and will be discussed later. 

We shall consider, for generality, the case of an alloy; pure-metal 
expressions will be obtained in the limit of zero impurity concen- 
tration. 

Let the quantity c, be unity or zero depending on whether the 
yth site is occupied by a B or, respectively, an A atom**. 7/8, (r) 
then takes the form 


Ws (t= Dy (ey) W4 Ot) Fey Waet) (6.4) 


Obviously, the summation in (6.4) is over all v. Substituting (6.4) 
into (6.2) we obtain 


W's (q) = Dy (Ley) Wa (q) ev +e Wh (q)e 9}. (6.5) 


Vv 


In this expression, we can separate out the average value °? (q) 
by adding and subtracting, at each site, the averaged Fourier trans- 


* Throughout this part of the book, the pseudopotential of an (unscreened) 
ion is labeled with the superscript b for bare. 

** It would be more proper to speak of ions. Traditionally, however, lattice 
sites are said to be occupied by atoms. 
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form of the alloy atomic pseudopotential. We have 
wyla =i DW (ge 41 D (Cyc) AW? (q) em". (6.6) 
Vy y 
Here o 
W? (q) = cW3 (a) + (1 — c) WA (0), (6.7): 
AW? (q) = W5 (q)— Wa (a). (6.8) 


Since W® (q) and AW? (q) are independent of v, they can each be 
taken out of its respective sum. The remaining expressions will be 
denoted 


1 -i 
N > e a S (q), 


1 -i 
= D (Cy —c) ev = C (q). (6.9): 
In our case of a perfectly periodic infinite lattice, 
S (q) = Ôq, sp? (6.10) 
where g, is any (nth) reciprocal lattice vector. At the same time 
1 -i 
C(q=£n) =~ J) (Cy—e) enw =x 0 (6.11) 
so that 
S (q) C (q) =0. (6.12): 


As a result, 


| We. (q) 12 = S (q) S* (q) | W? (q) 2 LC (q) C* (q) | AW? (q) |? (6.13) 
which, when substituted in (6.1), yields 


Q w’ a 1 Tb 
Us= gry FSM 5 (a) S* (0) | W (a)? 


" + € (a) ©* (q) |AW*(q) 2). (6-14) 


Of the two terms in (6.14), the first one is proportional to the squared 
average formfactor of the alloy atoms (in a pure crystal, to the 
squared formfactor of the constituent atom); the second term is only 
meaningful for alloys. The first term may be written 








, {— —__— 
U=Z Sy a AE Wa) be, 
q 


Q 2 1— n Wb 
=; 2) gh See |W (en) 12. (6.15) 
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It is thus reduced to a sum over reciprocal lattice vectors (except 
-g, == 0) and is therefore only determined by the (discrete) values the 
functions e (q), e* (q) and W (q) take on at q = g,. If the energy 
-of an alloy depends on this term alone, then when an alloy is being 
-considered instead of a pure crystal it is enough to replace Q and 


W (q) by Q and W (q) respectively. It should however be remembered 
that the differences in interatomic separation and average valency 
between pure crystals and alloys may alter the screening functions 
and formfactors of the constituent atoms. To take account of this, 
the formfactor of an alloy atom, W, (q), is often expressed in terms 


-of the formfactor of the pure-metal ion, wire (q), 


(u) 
W, (a) => 2) pyre (q), (6.16) 
Qe (q) 
where Q and ge, (q) are respectively the specific atomic volume 
and the dielectric permittivity of an atom of the p species as a pure 
‘component. There is no entire agreement, however, as to the value 
-of this approach as many workers consider (6.16) to be oversimplified. 
The next question to be examined is the crystal-structure stability 
‘from the point of view of the function UÑ. It is convenient to re- 
‘write this function as 


, — 5 , 
U=), Wgn) |? BE =D) Oys(En)- (6-47) 
n n 
‘The quantity D,, (£n) is called the (band structure) characteristic 

function. 

It follows from (6.17) that a crystal structure is the more stable 
the larger the magnitude of Pps (£n) at the points corresponding 
to the vectors g, of the structure (recall that Dy, (q) < 0 because 
x (q) < 0). The stability of a given structure thus depends on the 
properties of the function Pys (gn). 

The behavior of this function as a whole is determined by the 
«q-dependences of the functions it is composed of. In this section we 
will therefore discuss the behavior of the dielectric permeability 
£ (q), the Lindhard function y (q), and a typical pseudopotential 
W (q) and its absolute square. 

We first consider the function y (g). In Fig. 6.1 we plot this func- 
tion, as given by the Fermi-sphere approximation, 


-82 1—7? in reL] 

d=- [1+ me in| |: (6.18) 
where n = g/2kp. It can be seen that y (q) is everywhere negative 
and its absolute magnitude is a maximum at q = 0, and y (0) = 
(—3/8) Z/Ey. The function changes fastest in the vicinity of 
q = 2k,y, where the term containing (1 — n) ln | 1 — n | gives rise 





Ch. 6. Formalism of crystal-structure energy calculations 209 


to a relatively weak singularity of the type 0 x In 0. The singularity 

becomes more pronounced in the derivatives: it is of the type In 0 

in the first and O07! in the second derivative. For q larger than 2kp, 

| y (q) | becomes smal] and so 

does Dy, (q). x(a) 
The dielectric permittivity 0 

e(q) is everywhere positive 

(Fig. 6.2), diverges at q +0 as 

q~ and rapidly approaches unity —0.5 

with increasing q. Its singularity 

at q œ~ 2kp is similar to that in 

x (9). =1.0 
The behavior of e (q) is very 

similar to that of e* (q), the only Fig. 6.1. Lindhard function. 

difference being that the factor 

f (q) [146] makes e* (q) approach 

unity more rapidly with increasing q. All in all, because of the rapid 

decrease of | y (g)/e* (q) | with q, we see that Ups is dominated by 

Dis (q) for q not appreciably exceeding 2k;. We must, however, add 


0.5 1.0 1.5 q/2kep 


é(q) 


2.2 


1.0 qQ/2Kkp 
0 0.5 1 1.5 2 


Fig. 6.2. Dielectric permittivity as a function of q. 


a note of caution: the ®ps (q) for q > 2kp cannot be neglected until 
after a careful analysis of the convergence properties of Ups; other- 
wise the resulting value of Ups may be totally incorrect. 

The next function to be examined is W (q). We have shown that 
in principle a local screened pseudopotential tends to —(2/3) Ef 
as q ~0. The magnitude of W (q) increases with g, passes through 
zero at q ~ 0.8 x 2kp (usually denoted as go), reaches a maximum 
atq ~ 1.2 x 2kr, and then decreases to zero. For some types of pseu- 
dopotential formfactor, oscillations may occur at relatively large q. 


14—01063 
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They are usually eliminated by the damping factor D (q) = 
= exp (—0.03 (q/2kp¥}) [405]. 

It can be seen from (6.17) that the values of q which determine the 
| W (q) |? in the band structure energy correspond to reciprocal lat- 
tice vectors for which S (q) is different from zero. Hence as far as 
stability of a crystal structure is concerned, we are mainly interested 
in | W (g,) I for the first few reciprocal lattice vectors of the struc- 
ture under study. 

It will be convenient in the following to choose unit length for 
q to be 2n/R,, Ra being the radius of the Wigner-Seitz sphere. In 
this scale, the BCC, FCC and HCP structures have the same value 
of 2kp, namely, (9Z/4n?)1/3, Z being the ionic charge. This means 
that for Z = 1, 2, 3, 4 the quantity 2k, is, respectively, 0.611, 
0.770, 0.881 and 0.970. The lengths of the first reciprocal lattice 
vectors are in this scale (i) 0.696 and 0.895 for the BCC vectors (110) 
and (200); (ii) 0.677, 0.782, and 1.105, for the FCC vectors (111), 
(200) and (220); (iii) 0.638, 0.677, 0.722 and 0.930 for the HCP vec- 
tors (100), (002), (011), and (102). 

Comparing these lengths with 2k; shows that for the above close- 
packed structures, they are in the range between 1 and 3kp. The 
point qo (œ1.6kp) also falls within this range. In pure crystals, the 
low q region (where the singularity in e (q) occurs) does not contain 
the g, vectors and therefore has little effect on stability. 

There are a number of factors which may affect quantitative re- 
sults. These are the properties of the pseudopotentials and their 
formfactors, the manner in which s-d hybridization is included (in 
the transition, noble, alkali earth and other metals), exchange-cor- 
relation effects, etc. The way in which the particular result has been 
obtained will be described in some detail in future discussions of 
the energetics of a specific material. 

Now that we have obtained some insight into the behavior of the 
characteristic function as a whole, we are ready to discuss the main 
factors which determine the structural stability of pure crystals. 
We begin with an analysis of the function’s behavior in q space. 
In doing so we use aluminum as an example because it was one of 
the first materials to have been studied by the pseudopotential meth- 
od. In Fig. 6.3 we plot the function ®,, (q) for Al together with 
the values of |g, | which correspond to the BCC, FCC and HCP 
structures. 

Since the values of Pps (q) for q œ= qo contribute very little to 
Ups, the first vectors in the BCC and FCC structures are of little im- 
portance, as are ithe first two in the HCP structure. As for the rest 
the largest contribution to Dy, (q) comes from the (200) vector of 
the FCC structure, which makes this the most stable structure for 
aluminum, in full agreement with experiment. We thus see that, to 
predict qualitatively the structural stability of pure crystals, the 
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relative positions of g, and the first few reciprocal lattice vectors 
must be studied for the major crystal structures. A structure will 
not arise if its first vectors are close to gọ It will have a certain 
amount of stability if at least some of the corresponding reciprocal 
lattice vectors are lower in magnitude than 2kp, because for q œ> 2kp 
the quantity | y (g)/e* (q) | is relatively small. 

A general formulation of this rule might be: structures for which 
there are a large number of reciprocal lattice vectors whose magni- 
tudes are less than 2k, and at the same time are sufficiently different 


Pa (g) 


~0.05 





Fig. 6.3. Characteristic function Ppg (q) for Al, 


from qọ are stable. The rule is confirmed by a great many examples 
and even holds for the distorted crystal structures that gallium and 
mercury adopt. 

The rule explains the stability of the structures certain pure me- 
tals and semiconductors adopt and may be useful for preliminary 
analysis of relevant data. It will be realized, however, that a complete 
explanation of stability would require a total energy calculation in- 
cluding the electrostatic interaction and the contribution from the 
characteristic function for many reciprocal vectors. To illustrate this 
we again take aluminum as an example. 

The band, electrostatic, and total energies of aluminum were cal- 
culated in [154]. The band energy was calculated by summing over 
more than a hundred reciprocal lattice vectors. Table 6.1 presents 
these band, electrostatic, and total energies in units of Z°/2R,. 

It can be seen that, judging by the electrostatic encrgy alone, the 
most stable structure is BCC, while the least stable are the hexagonal 
structures whose axial c/a ratios differ from the ideal value, which 
is 1.633. If only band energy is considered for only the BCC, FCC 
and HCP (c/a = 1.633) structures then BCC has the lowest stability 


14% 
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and FCC the highest (this also follows from the relative positions of 
gn and gy in these structures). lt should be noted that these structures 
differ more in band energy than in electrostatic energy, so that it is 
in fact the band energy contribution that determines the sequence 
of structures in terms of total energy. 


Table 6.1. The parameters Ggg. Gyo) Fota, for Al 

















Structure | c/a | -ats | —Ahs -đ@total 
BCC 1 1.79186 0.01247 1.80433 
FCC 4 1.79172 ** 0.01414 1.80585 
Hexagonal 1.5 1.78998 0.01489 1.80487 
Hexagonal 1.6 1.79156 0.01395 1.80551 
HCP 1.633 1.79168 0.01390 1.80558 
Hexagonal 1.7 1.79129 0.01420 1.80553 
Hexagonal 1.8 1.78909 0.01583 1.80492 
Hexagonal 1.9 1.78497 0.01874 4.80370 


* Note that Chas is the same for all materials having the same structure as that as- 


sumed for Al. 
** The latest data is 1.79175. 





It would be wrong, however, to deduce from this that the electro- 
static energy is not important for stability. Calculations show that, 
from the energy band point of view, the hexagonal structure, whose 
axial c/a ratio is far from ideal, is most favored. The departure of 
a hexagonal structure from close packing leads to an increase in Ues, 
which compensates for the band structure effects in such a way that 
the total energy turns out to be lowest for FCC (which is indeed ob- 
served). 

This shows, incidentally, that Pps (g,) for a few first g, are insuf- 
ficient for a stability analysis, and U,, for a large number of g, must 
be added. 

Thus the minimum total energy, or the stability condition for a 
particular crystal structure, comes as a result of an interplay be- 
tween the band and electrostatic contributions, and generally no 
single factor may be claimed to uniquely determine the actual struc- 
ture. There are however factors which may be considered to be re- 
sponsible, to some degree, for certain properties of a crystal structure. 
For example, the electrostatic energy contribution secures the forma- 
tion of close-packed structures, and the choice of the most stable of 
these is determined by the band energy (and, in particular, by the 
rule that the | g, | must be less than 2k, and lie away from q,). 
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The occurrence of distorted structures in metals cannot be caused 
by the electrostatic contribution, but is related to the behavior of 
the characteristic function and is, as a rule, due to gy being close to 
the positions of g, in close-packed structures. The behavior of the 
characteristic function is determined, in this context, by the form 
of pseudopotential. 

It is worthwhile discussing how the structure is affected by the 
structure-independent contribution to the total energy. This might 
sound paradoxical considering our statement at the beginning of 
this chapter that this contribution is only determined by the free 
electron gas and has no effect on the arrangement of the atoms. Actual- 
ly, the statement was only partially true because it is the structure- 
independent contribution which is responsible for most (90%) of 
the interatomic separations in a crystal, the structure-dependent con- 
tribution to the total energy controlling the remaining 10%. The 
energy U, thus approximately determines the interatomic separation 
in a metal of a given valency and hence the geometry of the g,, vec- 
tors of the various possible crystal structures. Depending on their 
positions relative to gy (which depends on the behavior of the form- 
factor of the pseudopotential, which is determined, in turn, by the 
electron-ion and electron-electron interactions), a specific structure 
is preferred. Thus the structure-dependent contribution determines 
the arrangement of atoms at approximately constant volume. It 
is important to realize that the division of the energy into the struc- 
ture-dependent and structure-independent parts is to some extent 
arbitrary. 

Thus far we have discussed the behavior of the energy contribu- 
tions in reciprocal q space. Any function, however, may be described 
in both reciprocal and direct spaces, and it is of interest to repeat our 
analysis for a direct-space representation of the energy. 

The corresponding expressions for the band and total energies may 
be found by using the potentials of indirect and total interatomic in- 
teraction, respectively ys (r) and ® (r). These were defined in the 
Introduction as 


2Q i S 
Pys (1) =e \ Dys (q) ei" dq, 
(Ze)? (6.49) 
D (r) = + Pys (1). 
The band energy may be expressed as 
4 z , 
Uys =37 D) Di Py(lr l) (6.20) 


3 7 


At large r, the function Pys (r) describes an attractive Coulomb po- 
tential. This can be understood by remembering that at small q, 
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Drs (q) x q™> and that the Fourier transform to real space for func- 
tions of this type behaves as r~t at large distances. 

An interesting feature of Pps (r) and @ (r) is their nonmonotonic 
oscillatory behavior with increasing r; at large r they take the form 
of Friedel oscillations of the type (kpr) cos 2k,r. These originate 
[89] from the strong singularities of the type (1 — )~1 occurring at 
q œ 2kp in the second derivative of the characteristic function. 

The function ® (r) also oscillates at moderate r’s but the oscilla- 
tions are generally determined both by singularities in e (q) and by 
the behavior of the pseudopotential formfactors. In this region, the 
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Fig. 6.4. Interatomic potential Ð (r) for Al. Bold line—pseudopotential calcu- 
lation, dashed line—asymptotic approximation. 


position of the first minimum of ® (r) which, in principle, must be 
related to the nearest-neighbor distance of the crystal is of interest. 
Figure 6.4 shows the ® (r) for aluminum calculated for the first few 
coordination spheres. It can be seen that the position of the minimum 
does indeed correspond to the nearest-neighbor distance. 

The reader is reminded, however, that the interatomic separation 
is determined by both the structure-dependent contribution to the 
total energy and the structure-independent contribution. The func- 
tion ® (r) has been obtained from the former and should therefore be 
corrected by a term related to the latter. As this will include a rather 
strong interatomic interaction, the separation may be shifted by the 
correction from a minimum of the potential to its maximum, which 
is related to the structure and thereby to atomic redistributions. The 
energy of the crystal may then be reduced and because some of the 
atoms will be closer together and some farther apart, the “ideal” 
structure will in fact be distorted. This is indeed the mechanism re- 
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sponsible for the distortions in the crystal structures of gallium and 
mercury. 

The topic of stability may thus be approached equally in terms 
of direct and reciprocal space. From the purely mathematical view- 
point, the direct approach is less convenient because a great Many 
terms each of which oscillates with the number of coordination sphere, 
must be summed. The direct space is however often better suited for 
a qualitative analysis and yields more insight into the physics of the 
results. 

Although these arguments are £ 
widely used for the stability 
analysis of crystal structures, the 
theory is being developed in sev- 
eral ways, e.g. improvements in 









the exchange and correlation cor- N 
rections, and searches for more Exact <p, 
effective pseudopotentials to in- solution `> fn 


clude nonlocality, d and f reso- \ 
nances, and sd hybridization [3]. 

We wish now to address an- 
other topic of general importance, 
namely, the crucial relationship 


between the Fermi sphere and the 


Perturbation 
theory 


Brillouin zone boundary. When 0 2/2 A 
these touch we have o” 
2kp =n, (6.21) Fig. 6.5. Dispersion law Æ (k). 


and the denominator of the second-order energy term goes to zero. 
According to Jones [406, 9], the resulting singularity in e (q) mini- 
mizes Ups and is therefore the reason for the phase transitions which 
occur at the (critical) electron concentrations satisfying (6.21); the 
electron concentrations are governed by the Hume-Rothery rules. 
The second-order perturbation theory result for the dispersion £ (k) 
is 
E(k) = k? (k |W [ky + >) ete WIT (6.22) 
n wy (kK? —|k+ gn?) 


It can be seen from Fig. 6.5 that E (k) as given by (6.22) has a 
singularity in the neighborhood of the energy gap. In a two-wave 
model, the exact expression for Æ (k) is found near the Brillouin zone 
boundary from the secular equation 


4 
ke Ek) Wa, 
=0 (6.23) 
We, Hk + Bn [?— E(k) 
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Fig. 6.6. (a) Dispersion law, (b) density of states, (c) the departure Ups of the to- 
tal energy from the free-electron value near the Brillouin zone boundary. Bold 
lines—exact solution, dashed lines—second-order perturbation theory, dotted 
lines—free-electron gas. The vertical lines correspond to the condition g, = 


= 2kp. 
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and its explicit form is 
4 4 > 1/2 
E (k)= 5 {k+ | ktg l| k kgn 24W Yo 
(6.24} 


where Wg, = (k + gn | W |k). Figure 6.5 also shows E(k) ‘as 
given by (6.24). Comparing (6.22) and (6.24) indicates that the sec- 
ond-order approximation for Æ (k) is unsatisfactory near the Bril- 
louin zone boundary. 


Ups 
0.5 \ 


2.0 q/2kp 





Fig. 6.7. Ups plotted as a function of g/2k, for a single Brillouin zone face. The 
summation is over the entire Fermi sphere (bold line) and over the cone of solid 
angle 4n/N (dash line). 


The density of states in a metal and the corresponding total energy 
(per ion) have been exactly calculated in [407]. These formulae are 
too cumbersome to reproduce, but Figure 6.6 shows F (k), n (E), 
and Ups calculated both from these formulae and from second-order 
perturbation theory, Eq. (6.22). Although Æ (k) and r (E) differ 
substantially in these two cases, the two Ups plots as functions of 
the number of electrons only slightly differ and in neither case is 
there a minimum at g, œ 2kp. The results of [407] are significant 
for a number of reasons. 

First, they disprove Jones’s idea [406] that band energy minima 
occur at points where the Fermi sphere (surface) touches the boundary 
of the Brillouin zone; his interpretation of the Hume-Rothery rules 
is therefore devalued. According to [407], Jones’s results can be ex- 
plained by the way he treated the summations in the total-energy 
calculations; namely, instead of summing over the entire Fermi sur- 
face, Jones only considered a cone of solid angle 4x/N,, Np being 
the number of equivalent faces of the Brillouin zone. As a resull, the 
energy-gap effect was largely ignored (see Fig. 6.7). 
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Second, it was predicted that the total energy varies rapidly around 
Ln œ 2k» due to the square-root-type van Hove singularities in the 
density-of-state function; these lead to discontinuities in the third 
and higher derivatives: 


BU ps B 1 dn (E) oe 
dP |Ep  n(Ep> (“E ) ip: (6.25) 





‘These variations may reduce the structural stability of a phase for 
which gna happens to be equal to 2kp. However, since Ups decreases 
very smoothly with Z, no strict 
criterion may be suggested by 
which the stability range of 
a particular crystal structure can 
be found. It should be remarked 
though that a transition is most 
likely at g, œ 2kp, where Ubs 
varies somewhat more rapidly 
than elsewhere [419]. 

Finally, it turns out that the 
band-structure energy may in 
many cases be calculated by com- 
bining second-order perturbation 
Fig. 6.8. The regions contributing to theory, Eq. (6.22), with the Fermi 
the total energy. The “+” and sphere approximation. According 

— ” regions cancel out. to [407] the reason for this is that 

the relevant summations are over 

occupied states lying both below and above the energy gap. Since 

E (k) in either region are symmetric in k, the corresponding con- 

tributions essentially cancel each other out and the resulting 

error turns out to be very small. This is clearly illustrated in 
Fig. 6.8. 

Sometimes, however, the interpretation of an experiment depends 
on Ups difference (for different phases) rather than on the Ups values 
themselves. In this case, even a small discrepancy between the exact 
and second-order results may prove crucial, which is indeed the case 
1276, 408, 409] so that higher-order terms are needed. It would be 
interesting of course to study the higher-order contributions in a 
manner similar to that of [407]. It also seems necessary to investigate 
if the above conclusions retain their validity for nonlocal OPW 
calculations. 

Thus far we have only been concerned with the UR term, so, strict- 
ly speaking, our considerations are only valid for a stability anal- 
ysis of pure crystals. Nevertheless, the energetics of alloys have 
also been treated in this manner, especially in the earlier papers. 
This approach is called in this context the virtual (or average) crystal 
approximation. A more correct total energy calculation must certain- 
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ly include the term U% which is 


== See c@cr(@ |Awe(q) 2. (6-20) 
q 

The difference between the Fourier transforms of the potentials of 
the components must generally be less than their average value (for 
the same q). This appears to be a sufficient argument for neglecting 
U. But it isnot. The U{? term is a sum over the q’s which are equal 
to “the reciprocal lattice vectors. The summation in (6.26) is over the 
continuum of q values including the relatively small ones for which 
the formfactors W} (q), W$ (q) may be considerably larger than 
W? (g,), W% (gn). Hence the difference between the pseudopoten- 
tial formfactors of the components may be comparable with (and 
perhaps even larger than) the average formfactor. Neglecting the 
second contribution in (6.14) thus becomes very questionable. 

We. will obtain various explicit expressions for (i) complete dis- 
order, (ii) complete order, and (iii) short-range order depending on 
the manner in which atoms of different species are distributed over 
lattice sites. 

In the first case we deal with a disordered alloy, by which we mean, 
here and in the following, that it has no long-range order. Thus its 
atoms are distributed over the sites either completely at random or 
with some degree of short-range order. For such an alloy we have, by 


(6.9), 
C (q) C* (q) = y m È 2 (ey — c) (ey —e)) eT tvv, (6.27) 


Since c, = 1 only if the vth site is occupied by a B atom, 
CyCy = cPiv', (6.28) 


where PPZ, is the probability of finding the vth and v’th sites simul- 
taneously occupied by B atoms. Substituting (6.28) into (6.27) and 


denoting t, — ty = p; we obtain 
C (g) = D>) (PP —c2) es. (6.29) 


i 


In arriving at this result we utilized the fact that PŽ. decreases 
rapidly with p;; the double sum is therefore replaced by a single 
one and hence the factor 1/N. 

Using the total probability theorem 


PPP 4 pPA | (6.30) 


220 Part 2. Pseudopotential theory in crystal-structure calculations 


and introducing a short-range order parameter [411] 





p24 
aj=1—7— » (6.31) 
we find 
|C (q) =E J a (oie, (6.32) 
Pi 


the summation running over all ọ;. Thus, for a crystal with short- 
range order we have 


21— —iq0; 
UR = gay > 2 a 22 | AW? (q) [2e (1—c)a (pi) eH. (6.33) 


If œ (0;) does not depend on the orientation of p;, we may convert. 
the summation to an integral and then average over all angles between 
q and p; to obtain 


U$ = ce (1—6) È CiaiV ng (P:)- (6.34) 


Here C; is the coordination number of the ith coordination sphere 
and Vy, (p;) is given by 








= Q EE sin 90; z 
Vos (Pi) = a f dqq°F pns (4) Go, (6.35) 
where 
zx Q —e (q) 
F pws (= | AW? (a) PPE (6.36) 


In the case of no short-range order (the atoms of both species are dis- 
tributed completely at random over the lattice sites), all the a,’s 
are zero except for a», which is unity. The second band-energy term 
(which involves formfactor differences of the two components) may 
then be written in the q-sum representation as 


Q ’ — 
Ut? chaotic = oR > ee {ya — c) | AW? (q) l2} . (6.37) 
q 


The band energy of a completely disordered alloy thus takes the form 
4— — 
Ubs, chaotic = an xD ee 2 {| we (q) |2 bag, 


ECT | awe(q) |2}. (6.38) 


{Oo 
lo 
= 
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Allowing for short-range order we find instead 
Q , {— — 
Uysch= gr D Poe [W (a) |? 8,2, 
q 
1— -i c 
HLZ X) a (p1) AW? (q) 1207}, (6.39) 
Pi 


Thus in the absence of long-range order, the energy characteristics of 
a crystal will be estimated either by (6.38) or (6.39), depending on 
whether it has short-range order or not. Although short-range order 
is ubiquitous [412] and, indeed, alloys without it are the exception, 
most theoretical studies have until now been based on (6.38). This 
was due to a now out-of-date underestimate of the short-range effects 
and to a lack of an adequate theory which would allow us to incorpor- 
ate them correctly into the pseudopotential calculations. Note also 
the extra numerical difficulties arising from the inclusion of these 
effects. 

We now wish to obtain the Ups for an alloy with long-range order. 
The above discussion suggests that the task is actually to find the 
Hè. (r) for this case. If the atoms of an alloy are arranged in a com- 
pletely ordered manner, a summation over a crystal is conveniently 
subdivided into (i) sums over the sites v of the unit cell and (ii) 
sums over the centers j of the unit cells. We would then have 


1 
WaN) => D X (1 — ev) W8 (r — bys) + cv; W8 (r— tv;)} (6.40) 
j y 
or, in the q-space representation 


a : ~igty; , -igqty; 
1% (q)=+ Dd) Dae) Wa (qhe "4 + ewh a e 
j x 


(6.41) 


Since all unit cells are by definition identical, the factor e~*3' may 
be taken out of the sum (t,; = ty + t;) giving 


wea =a {oY} D (ey) WH (a) en eW (qe. 


(6.42) 
The first sum here is clearly 
ar ye = Seay (6.43) 


j 
where NV’ is the number of unit cells in the crystal. This actually 
means that (6.43) is equivalent to (6.9), except that instead of sum- 
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ming over the atoms the sum is over the centers of (the) unit cells. 
The v-summation yields the “pseudopotential” structure amplitude 
of the unit cell, which is identical in form to an analogous quantity 
involved in X-ray studies. In the present case, however, it is electron 
“rays” that are scattered and the role of X-ray atomic factors is taken 
by the pseudopotential formfactors. 

Thus, for crystals having a Cu,Au-type cubic cell we obtain 


We (q) = + {W} (q) + WS (q) [e77 +H) 
en Mi (Het As) 4 ezm (Ha+H:)]y, (6.44) 


where n’ is the number of atoms in the unit cell. 

For reciprocal lattice sites with equal-parity vector components 
H,, H. H, (the basic reciprocal-lattice vectors g,,; are associated 
with these) we have 


WÈ (a) = {W3 (a) + 3W% (q)} = W° (0). (6.45) 


For sites whose H,, H,, Ha components are different in parity (the 
superstructure reciprocal vectors g,,,, are associated with these} 
we have 


We (a) = -y (W3 (0) — Wå (a))=-7 AW? (a). (6.46) 


As a result, the pseudopotential formfactor for such a crystal will 
be (per ion) 


, Wt (q) for equal-parity H,, Ha, Hs, 
c — 
Wer (q) = + Aw? (q) for unequal-parity H,, He, H3. (0.47) 
Similarly, for CuZn-type cubic crystals, we find 
R Wè (q) for Hi + H+ H;= 2n, 
Wc (a) = + AW” (q) for H,+-H,+H3;=2n-+1. (6.48) 


It should be noted that in a disordered state the Cu,Au-type super- 
structures change into FCC structures and the CuZn-type into BCC 
structures. 

Similar expressions can be obtained for other superstructures. 
Although the argument that led to these results is somewhat different 
from that at the beginning of this section, it is easy to show that equa- 
tions like (6.45) may equally well be derived from (6.27). 
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Thus the band-structure energy of ordered CuZn-type alloys has. 
the form 


l g 1—E (En.f) | Ws 
Ubs = -57 > Sn, ee |W? (gn.4) 1? 


n,f 


Q 2 1—€(8n,ss) | AWÈ (gn,ss))/? : 
Har Qs Sas Ey a o (0-49) 
n,sS 
It is now clear that analogous expressions can be derived for an or- 
dered alloy of any other type. ln our discussion below of specific al- 
loys, some of these expressions will also be required for partially 
ordered alloys. 

It is worth emphasizing once more that the formulae for the abso- 
lute square of the electron transition matrix element (between states. 
|k) and (k + q |) are analogous to the formulae for the intensity 
of X-ray (neutron or electron) scattering. The latter formulae may 
therefore sometimes be used to obtain the absolute square of the 
pseudopotential formfactor for various types of order (including 
partially long-range order, which we have not, for the lack of space, 
discussed). It should be remembered, however, that the q dependence: 
of the pseudopotential formfactor is entirely different, so that nu- 
merical estimates may not be sensibly transferred from X-ray (neu- 
tron or electron) scattering. 

To conclude this section, note that, strictly speaking, our results: 
are only justified for simple metals, either pure or in alloys. With 
the noble metals, the additional hybridization potential [308] makes. 
a total-energy pseudopotential calculation much more difficult (for 
details, see [313, 430-432]). The interesting point concerning the 
noble and transition metals is that both the interatomic potentials. 
of pure metals and the ordering potentials of their alloys not only 
exhibit Friedel-type oscillations but also contain terms with cosines: 


of 2} E4 instead of cosines of 2kp [433]. 


6.3. Electrostatic energy 


Following the pattern of the previous section, we start with deriv- 
ations of general expressions, and then we apply them first to pure 
crystals and then to alloys, first completely disordered, and then 
with short-range and long-range order. 

The electrostatic energy of a pure metal may be calculated by 
applying the Ewald procedure [413, 414] to a system of charges dis- 
tributed in a uniform compensatory field. In [415] this method was. 
extended to include alloys. Our discussion will closely follow the 
argument in [404]. 
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Let the sites of a crystal lattice be occupied (randomly or with 
some degree of short-range or long-range order) by positive point 
charges Z% and Z%, whose concentrations are, respectively, 1 —c 
and c. This array is embedded in a uniformly distributed negative 
charge of density Z*/Q = [(1 —c) Z4 + cZ pl [1 — c) Q, + cQ pli~. 
Clearly, this choice of charge density makes the crystal electrically 
neutral as a whole. The charge density produced by the positive point 
charges at a point r is 
o(r) = (1-0) ZA (ry) Fey Zh(e—ty)}. (6.50) 

Vv 


ry 


liere the functions Z% (r — t,) and Zš (r — t,) are delta-like (we 
are dealing with point charges), so that to do the summation each 
point charge should be given a (positive or negative) Gaussian “cap”. 
We first evaluate the potential @, produced by the combined action 
of the total uniform negative charge and of all the “positive” Gaus- 
sian caps, Ap,. This latter term is 


Ap, = X {0 — e) Z4 oT w erp BE) (2)°. (6.51) 


y 


Taking the Fourier transform of Ao,, adding the Fourier transform 
of the uniform charge, and solving Poisson’s equation yield the po- 
tential duc to the total ane 


(r) = 22 5 eft $ {1 — e) Za eyZeye ty, (6.52) 


KY 


The potential ə, which arises from the positive point charges and 
negative Gaussian caps (these exactly cancel the positive caps) is 
also obtained by integrating Poisson’s equation. The result may be 
represented as 


_ 
P(t) = Dy (ley) ZA + ey zp e, (6.53) 
vV 


Since the Gaussian caps have no physical meaning, the total poten- 
tial p = Pı + P must be independent of n. The sum of the positive 
and negative Gaussian caps is zero, which makes it possible to choose 
the constant of integration as —nZ* (mQ), 

From electrodynamics the total cnergy of a charge distributed in 
the field ọ (r) is known to be 


Us = | o () 0) ar. (6.54) 
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By evaluating ọ (r), substituting the result into (6.50), and making 
some transformations we find (per ion) 


Jazz TA a qr exp (1) [Zso 


+ (24-25 1C (a) I? ] + DD E 
vey 
-+ (cy —e) Z* (Zä — Z$) + (cy —c) Z* (Z4 — ZË) 
1--erf (V 1 | ty—t,,|) 
| ty—t,, al 


-5 g} —2 L7}, (6.55) 


+ (Cy — c) (Cy — c) (Zå — Z5)"] 


This may be rewritten as 


Us=27 {= > qe (i) 1S0 
q 





1 1—erf(Vq|tyv—ty|) on E 
t+ — am IMM —Ĉ 
tN 2i [ | ty— ty | 42 2 2}} 
VV ÆN 
(Z4 — Z5) fán 1 q 2 
+E a S exp (— 4) 1Co 
1 1—erf (V4 | ty—ty, |) 
—ct— oe?) —— aIo Io 
+ N c( c) > (CyCy c ) I ty— ty | 


V, V'EN 


—2 y eao} (6.56) 


where the first term in curly brackets represents the contribution 
from the “average” crystal and the second is that from the charge fluc- 
tuations. 

By an appropriate choice of the Ewald parameter yn, the v and v’ 
sums can be made small and the electrostatic energy of the alloy 
takes the form 


Ues = ey {£3 Sep (-+) Is-52 3} 


+ SAE (A exp (E) ei 2-2 Beto}. 
i (6.57) 





15—01063 
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The pure-metal case is obtained from (6.57) by setting e=0. We 
have 


l yya f 4 41 2 ay 
Us => 2 ts 2 “qe eain] S (a) PF -2y 2}. 
q 
(6.58) 


The term in the curly brackets here is usually denoted as «,,/R,; 
its values for various structures are listed in Table. 6.1. It can be 
seen that the electrostatic energy favors the compactness of structure: 
it is greater in hexagonal structures other than the HCP and increases 
from close-packed cubic FCC and BCC structures to the simple cubic 
(Qes = —1.76012) and the diamond-type (œe, = —1.67100). The BCC 
structure should be the most favored by this criterion: its higher 
stability with respect to FCC and HCP is due to the effect of second 
nearest neighbors. 

The electrostatic energy of alloys will be evaluated using the 
| S (q) |? and |C (q) |? obtained earlier. By substituting them into 
(6.57) we find 

(i) for a completely ordered alloy 


Usora = 4 {= È ey oP (- St) | S (2,44) E -77 


n, fÍ 








—2 Vipa a a zoe (- Ehst) 1C (gns) 12 
-x= 2}, (6.59) 


(ii) for an alloy with short-range order 


Danm E r) 
n, f 





Q 


X19 (Bay) -E 2y 2) 
PEATE pe S exp (-) 
q 


x >; c(1—ce)a (p;)e 7 ®i— 2c (1 —e) VY 2}. (6.60) 
q 
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(iii) for a completely disordered alloy 


Ues, chaotic 


4 pp 40 cj 
= 2" = > 2 exp ( — 


Q n, Í 


n, f 
> L X} 
qg? ( 
q 








ght i 9 g S/n 
aa) |S (gn)? ~Ta7? V a 


- ) e(1—e) —2c(1—c) V2}. 


(6.64) 


(Z% — 2%)? 
p 





{ NQ ° 


It has been shown [415] that the second su'n here is zero. 


6.4. The total internal energy of an alloy: 
second-order perturbation theory 
and the locality approximation 


The configurational part of the total internal energy (i.e., the sum: 
of the band and electrostatic contributions) determines a structure's: 
stability at low temperatures (in principle, at the absolute zero if 
zero-point energy is ignored). We did not think it appropriate to- 
discuss here the mechanical destabilization of lattice due to shear 
effects (for more details see [416-417]). 

Expressions for the configurational part of the internal energy 
are given below (i) for crystals with long-range order, (ii) for crystals 
without long-range but with short-range order and (iii) for a com- 
pletely disordered alloy. 


6.4.1. An alloy with perfect long-range order. From (6.26), (6.49) 
and (6.59), ths configurational part of the total energy is 


1 {zpr án 4 Eh, f n 
Ucont= -5 |Z [> ey (-a) |S (Sn) PT 
n, f ; . 


2 2 +e Tn 4 hay eens |S En. 1) [2 (Bn, 2) 12} 





Haaga ae x (— FEE) |C lena) P 


hss En, ss 
— — n —E(En,ss 
2c ( 1 c) V2 + | Tr = P En, ss = ae ae l C (En,ss) |7 


x | AW? (gn,s3)12}. (6-62) 


15# 


228 Part 2. Pseudopotential theory in crystal-structure calculations 


‘The first term in the curly brackets represents the contribution due 
to the average crystal; the second is due to fluctuations. The essential 
point to note is that, unlike the pure-crystal case, terms with rela- 
tively small wave vectors are important in (6.62) and the fluctuation 
contribution may happen to be sizeable. It is for this reason that 
ordered alloys received a great deal of attention even in the earliest 
works on the electronic theory of metallic alloys. 

The above expressions may be helpful for studying the stability 
of ordered alloys. They may be used, for example, for obtaining the 
thermodynamic potentials of neighboring phases (with the subsequent 
determination of the points where these potentials are equal) or for 
calculating the configurational energies of different phases. An inter- 
esting study would be to combine alloy pseudopotential statistical-dy- 
namical theory with the method of statistical concentration waves. 

As is discussed in [401] the type of the ordering that occurs below 
a certain (“ordering”) temperature T, is controlled by the symmetry 
of the Fourier transform of the ordering potential. The ordering 
temperature and the depth of the minimum of the Fourier transform 
are related by 


T= —(í —c) cW (Snes) (6.63) 


so that when determining T, and the type of ordering, or when find- 
ing the existence range of an ordered phase we need not calculate 
the total energy of the alloy; it is enough to find the Fourier trans- 
form of the ordering energy in the vicinity of its minimum. Certain- 
ly, the position of the minimum has also to be calculated. In other 
words, we must calculate the reciprocal-space distribution for the 
Fourier transform of the ordering potential, locate minima, deter- 
mine their symmetry and depth, and this may yield enough informa- 
tion for finding the stable ordered phases, their types, existence 
Tanges, transition temperatures, etc. 

The following method for obtaining the ordering potential’s Fou- 
rier transform is taken from [401]. We begin with an analysis of the 
band energy (cf. (6.26)) as given by 





Up=% re 12) | awe (6.64) 


e* (q) 


It is known that an arbitrary wave vector q may be represented as a 
sum of the closest reciprocal lattice vector g, plus the difference be- 
tween the q and gy? 


q=g,+Aq. (6.65) 


‘The q summation in (6.26) may then be divided into the sum over 
Aq within the first Brillouin zone and the sum over the reciprocal 
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lattice vectors gn: 


u= Š 2 2 | $n + Aq |? [ AW” (gn +40) |? 


48 (En FOO 1 C (gn + Aq) |% (6.66) 


X $F (En + Aq) 
The function | C (q) P possesses translational symmetry so that 
| C (a) |? =| € (Aq) |’, (6.67) 


Denoting the reciprocal lattice sum in (6.66) a 


ntA 
Vis (Aq) = e w {2 | gn +Aq |? [AW (gn + Aq 2+4 eee 


4— 
— D IAW (o) rae SP}, (6.68) 
q 


where the last sum results from dropping the primes in the first, the 
fluctuational contribution to the hand energy takes the form 


Uy = => Vrs (Aq) | C (Aq) |? (6.69) 
Aq 


Clearly, Vps (Aq) is the Fourier transform for the band contribution. 
to the ordering potential (in the first Brillouin zone). 

The electrostatic contribution to the total energy should be treat- 
ed in a similar manner. The starting point this time is (6.57). By 
the same argument as used for the band energy we find 


Ue = + (24-2 {= D Doenw(-L)ic@e 
Ag n 
—20(1—e) 1} =£ 5 1C (4q) |? Ve (40), (6.70) 
Aq 


where the Fourier component V,, (Aq) of the electrostatic energy is. 
given by 





Ves (Aq) == {a TE exp ( — fat Sal” aM P ) 
SS ep (E) 6.74) 
q 


The total configurational energy takes the form 


Vo=. >) | C (Aq) |2V (Aq), (6.72) 


4q 
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where 

V (Aq) = Vys (Ag) + Ves (Aq) (6.73) 
is the Fourier component for the ordering potential. By calculating 
V (Aq) for the most important points (directions) within the first 
Brillouin zone (or, if necessary, over the whole of it), it is always 
possible to locate the minima of V (Aq) and to evaluate them for 
vectors which correspond to superstructural sites of the reciprocal 
lattice. We note that the ordering potential itself 


V (p:)= Vaa (9:) + Ve (93) — 2V 438 (03) (6.74) 
will take the form 


2 ” ido; 
Vio) =F >) Fp(ave* (6.79) 
q 
or 
Vv ( _ Q | do o2 F sin qi 6 76 
p) = z |) UPF ol) a (6.76) 
where 


Q 1— 
Fy (Q= S| AW" (a) PAS 


27 gy oon q? ~ > er 
HE AZS exp (ir) = Frs (OH Felg). (6.77) 
It is an easy matter to show that V(p;)may be written as 


Q ingo; (Q 1—e (q) 
v o) = | dg @ hee fF Se | Wa (a) | 


27t q? 2Q sin go; f Q 1—e (q) qb 
LE ge — 1 —.- 2 qPi 2 q 
E ZB exp ( án )} m1 f dq q qi { Ba q E* (q) Wa (g) 


x Wh (q) +e Z4Zb exp ( — £) } 


2 2 Singo: fR 2 1-8) | iyt 
+= | dg ð; [aa sga | Vala) l 


2m 52 ff q? 
+ Za exp (— iy )} . (6.78) 


Each integral in (6.78) corresponds to the total effective inter- 
atomic potential and depends on the interatomic separation in a 
quasi-oscillatory manner. For nontransition metals, the behavior of 
these potentials corresponds to Friedel oscillations. Clearly, the sum 
of these integrals (i.e., the ordering potential) will behave in a 
similar manner. 

Expressions like (6.73) have also been derived for alloys with 
complex structure (see [419, 420]). 
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6.4.2. Alloys with short-range order. Using (6.33) and (6.60), the 
configurational energy of such alloys may be written as 





1 (Fae An 1 - Sn. f Jt 

Peotone = 2 (LED gz ese (aie) 18 Goud P35 
1], B WV ge 1-8 ns) = 

-2y 2}4+25 gn, j gt |S (En) (2 (7? En) 12} 





f 
+4 faze [4 3 Lexp(—£) 16 (@) eae / 2] 


+e PASE caw aj}. (6.79) 


q? 
án 


The first part of this expression again corresponds to the average 
crystal; if the short-range order leaves the average atomic separation 
unchanged, this term will have no effect on the formation of short- 
range order. The effect of the interatomic separation and average- 
crystal energy on short-range order was first discussed in [403]. 

The second term in (6.79) represents the alloy energy associated 
with short-range order. If œ (pọ;) is independent of the orientation of 
0;, it may be conveniently written as 


Ucort.sher, =€ (1 — c) > CiaiV (pi) (6.80) 


where V (p;) is given by (6.76). 

It should be noted that to use these results requires a knowledge 
not only of the ordering potentials (and this for as many coordination 
spheres as possible) but also of the short-range-order parameters them- 
selves. For this reason, these results are in fact unsuitable for pre- 
dicting short-range-order and may only be helpful when estimating 
it in the (very rare) situations when independent considerations 
indicate that one of the spheres will predominate. 

A somewhat different and more promising way to employ these 
expressions was suggested in [404, 421-423]. Since the equilibrium 
short-range order corresponds to the minimum of the free energy 
{more generally, of a thermodynamic potential), it follows that to 
get the equilibrium short-range-order parameters we must evaluate 
the free energy (using the above expressions for the configurational 
energy) and then minimize it with respect to the parameters. 
Assuming the latter to be independent we obtain 


Toa =tio) {exp (262) — 1). (6.81) 


This formula gives the short-range-order parameter for the first 
coordination sphere, provided it contributes more heavily than 
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the other spheres. Although this approach is still used in interpreting 
experimental data [412] it is not strict enough. It is more consistent 
to obtain the V (p;) from (6.75)-(6.76) and then to improve them 
using the statistical theory of short-range-order. This gives [418, 424]: 


o —1qQ0; 
j= HESS ye 
a (:) = | Ey re (6.82) 


p0 





It can be seen from this that this method involves the summation 
over the unit cell of reciprocal space. Accordingly, Ueconf.sh-r. 
and @ (p;) will contain small g terms whose contribution to the band 
and electrostatic energies will require a careful analysis. 


6.4.3. Totally random alloys. The configurational energy may be 
found from (6.79) or earlier expressions to be 


U cont. chaotic 


-4 [7 ES D a eop (— BL) 15 Enp) l 











y Ont 
-any 2]+é 2 AEP 1S Ens) 2 | Wen PF} 
1— , 
+5 {yell -03 q 2 | AW (a) Pe}. (6.83) 


Both the average crystal and the fluctuational contributions may 
be important. The fluctuations are important because at small q 
the pseudopotential matrix elements are large. When calculating 
the configurational energy for a disordered alloy, either with or 
without short-range order, it is always necessary to verify the correct- 
ness of the fluctuation term. 


6.5. Higher-order perturbation analysis 


Important contributions to pseudopotential alloy theory came 
from [409] and [410], which examined alloys with long-range order 
and short-range order, respectively. In [410], an inclusion of higher- 
order perturbation terms was attempted. Since second-order pertur- 
bation theory corresponds to pairwise interaction, this is equivalent 
to an incorporation of multi-ion effects. According to Krasko and 
Makhnovetsky’s argument [409], a second-order approximation is 
only sufficient if 


W (qEr <1. (6.84) 
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The appearance of an ordered phase gives rise to superstructural re-- 
flexes of the reciprocal lattice in the region of relatively small q 
where the pseudopotential formfactors (and their differences) need. 
not be small. It has been estimated that, typically, 


i j oc | "in 2) | (6.85) 


Consequcutly, while the average formfactor of components may be: 
calculated to the second order for the smallest g,,; the difference form- 
factor which corresponds to the superstructure site g,,,, requires the- 
inclusion of the third and fourth orders (i.e., the three- and four- 
particle interactions). Working along the lines of [425], Krasko and 
Makhnovetsky [409] were able to show that the inclusion of the third 
and fourth order terms is in fact equivalent to the replacement of 
the ordinary potential by an effective k-dependent one, 


~ ” W (En) W (By —8n) 
W (gask)=W (gn) + 2) i or 


nen’ k+En , 





W (8,)W (8-8 ee En) 
+3 > EA u JE 5> (6.86): 
nAn’ nen. kter) Ek Ek+ew 
where g, are sites (both structural and superstructural) for which 
(6.84) is not satisfied, and £} is the free-electron energy. Krasko and 
Makhnovetsky point out that to go beyond the third order requires- 
consideration of the nonsphericity of the Fermi surface and they sug- 
gest a way of doing this. After transforming to the effective pseudo- 


potential w (gn. k) the energy Ep takes the form 


n )W ns k 
Ex = EX + 2 W eW en K (8.87) 


k Ekte, 


The total band energy of an ordered alloy may be written as 


Ubs = = UR + 2, AU) En, z) + > AUy, (En, ss)» (6.88): 


where Uj? is the second-order band energy (earlier, Ups) and 
AU gs (En, P and AUps (£n,ss) are the third- and fourth-order correc- 
tions due to the structural and superstructural sites for which (6. 84) 
fails. The pairwise contribution to the crystal energy is regarded in 
[409] as the “metallic” part of the cohesion energy; this is a sum of 
the band and electrostatic contributions: 


U pot == Ueg U2. (6.89) 
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‘The higher-order corrections characterize the covalent part of the 
cohesion energy. The application of this theory (in combination with 
the statistical theory of [401]) to the analysis of phase stability re- 
‘quired the Fourier transform of the ordering potential V (9) for a 
number of characteristic points in the first Brillouin zone. This is a 
sum of “metallic” and “covalent” parts 


V (Sn, ss) =V (Sn, ss) HVY (En, ss) (6.90) 


‘where V™et (g,,,,) describes the pairwise interaction, discussed above, 


Vg Q we (En,) 
v” (k= >) TA me D (gno ktg) —4 D P (En a)} 
ny q 


(6.91) 


(the last sum in the curly bracket arises because the VY (R = 0) 
term was dropped in the configurational energy) and 


AW? AW — 
D (qu, gy) = A B gi fe (a) — 1] [2e (q1) + 11 7 (ae) 


+ qg; [e (q2) — 1] [2e (q2) + 11 Z (q4) + Z (q2 — qa)l}, (6.92) 





where 
T(k, gn) = (Ek — Ek+s,) t 
I (k, En, gu) = (EL — Ehra, (Ek— Ektgn) t (6.93) 
The third-order short-range-order energy was evaluated in [410, 


-426] using the formalism developed in [276]. The total energy of a 
crystal in mth-order perturbation theory may be written [276] as 


UJ =UO UDUR... HUM (6.94) 
For the third order, 
I b b b 
UR —9 u ehk _ 
bs 2i E* (q1) &* (q2) &* (Gy) 


qi’ Qs) qs 


x A® (qi, Gey G3) S (41) S (qo) S (q3) A (qi + q2 + qs). (6.95) 


Here A (qı + qa + q3) secures the conservation of momentum and 
A? (qi, q2, q3) is the irreducible three-pole for a circular diagram 
with three external-field “tails”. For the present case 


1 for q,+42+43= 0, 


Alta ta) | 0 for q,-+4>+43%0, (6.96) 
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and 
2 *)2 
N (q. G+ gn gn) = -5 LE lag — (ggn)? 
IEn tg l 2kp q | qg 2ke+1 +n | 
x4 ats) — en LET ie SL 
gn la+ gn | n 2kp—q JEn In| eae 
1—DA = 
tagn , | keg ln ppa’ Ino ke - 
> —D , (6.97) 
q14+8n | 2kp—q 
_2arctan DA, qp<kp_ 
where m* is the effective mass, 
k 2 1 |q+8n | 
D=1/ (#£\— — í _aeniat+gnt 
} ( qR ) L =a Veh — En)? 
A= Ign |G+SEn | 
(2k p[i i q+8n £] » mopar tan DADO 
F 2 (kp)? 


Using the crystal potential as given by (6.5) and (6.6) and taking 
the Fourier transform of the set of occupation numbers c,, the third- 
order band-structure energy is 


UY — O > AC) (di, q2, G3) 
e* (qi) &* (qe) &* (q3) 
Ti Tg q 


x {W® (q1) W? (qo) W® (qs) S (q4) S (q2) S (Gs) 
+ 3W® (q1) W® (qr) AW” (qs) S (qu) S (Go) C (da) 


+3AW” (qx) AW? (q2) W? (qs) C (q1) C (a2) S (43) 
+ AW (qi) AW” (q2) AW” (qs) C (qx) C (qa) C (d3) } A (qi + q2 + Gs) (6.98) 


where the “difference” and the average crystal terms are separated. 

It can be shown that the first term in the curly brackets corresponds 
to the average crystal contribution and gives only a small addition 
to it. 

The second term is proportional to S (qı) S (qa) C (q3) and is zero 
for the following reason. If A (qi + qa +43) = 1, then qı + q2 + 
q; = 0 and since q) and gq, are reciprocal lattice vectors, so too 
is q}, and S (q,) S (qa) C (q) will therefore vanish because C (q; = 
gn) = 90. If qv +q +q; #0 then A(q, + q, +9) =0 so 
that the second term in (6.98) is again zero. 

The third term in (6.98) corresponds to the double scattering of 
electrons on ion pairs and must add something to the pairwise inter- 
action energy. 
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The last term is proportional to triple correlations in solid alloys; 
since the probability of such correlations is very small] this term should 
be negligible. 

As a result, the short-range-order contribution to the energy of 
a solid solution will be (to third order) 





tg ~ igo; b x (a) 
Ubs, sner. = WV c (1 c) 2al; 2 { IAW (q or (q) 
2n AZ 


+p 


’ b 7 A3 (q1, 4+8n; En) 
x [2 AW" a+ en) Wn) cepa ee Gey It? (6-99) 


exp ( — É) + 32A W? (q) 


n 


the ordering potential including, and the electrostatic contribution 
takes the form 








= 2nAZ*? exp (—£) 
— L an 
viej= & || St 
Q ,1— oe 
+e? a IAW? (q)|? +2 (o) | Sg dg, (6.100) 
where 


AW? (g) Sy’ W2 (gn) AWO(| q+, 1) 
T (q) = 30 F) 2 ¥en) e 1 qn I) 


x AO (gq, [a+ gnl, gn], (6.101) 


the summation running over the entire reciprocal lattice except for 
the vector g, = Q. It can be seen that the third-order term gives rise 
to the term T (q) which may affect the ordering potential. The third- 
order contribution may thus influence both the ordering energy and 
the short-range-order contribution to the alloy energy. 





6.6. OPW nonlocal alloy theory 


The above results were obtained using local model potentials. 
It would be desirable to rederive them using true pseudopotentials, 
based on the orthogonalized plane wave approximation (the OPW 
or first-principle pseudopotential). 

Pseudopotential alloy theory was pioneered by Hayes et al. [398] 
who formulated the OPW nonlocal alloy theory and calculated the 
ordering parameters of Li-Mg alloys. It is only natural, however, 
that most of the papers which followed were based on local model 
potentials, as these make numerical work easier and simplify the 


Ch. 6. Formalism of crystal-structure energy calculations 237 


interpretation of results. These studies yielded a number of useful 
results some of which were outlined above. Quantitatively the agree- 
ment with experiment was not entirely satisfactory and the most 
natural way to remedy this was, of course, to question the assumption 
of locality. This gave rise to extensive work on nonlocal OPW pseu- 
dopotentials and their application to alloys. An important contri- 
bution came from Hafner [427] and [428]; and the following discus- 
sion of OPW alloy theory is based, in fact, on [428]. The starting point 
of the theory is the standard Schrodinger equation with a potential 


Y(r) = >) Y, (rcr) + DY Ya(r—ry) (6.102) 
ÑA) iB) 


which is to be determined self-consistently later. 
For both types of core state, the orthogonalized plane wave of qua- 
si-momentum k (OPW,,) has the form 


{OPWe)=k— >) È |ry, At) (ry, At|k) 
XA) t 
3 s 


where k is related to the normalized plane wave, P4 and Pp are 
projection operators and the bra- and ket-vectors represent ion-core 
states centered on the r; sites, 


(r|ry At) =} (r—ry), (r|rj, Bs) = Y? (r—r)). (6.104) 


By expressing the wave function Yk in terms of the generalized OPW 
(6.103) we have 


Wy) =D) ar (qQ)|OPWesq)=(1—Pa—Ps) Px, (6.105) 
q 
where 
Dy = X a (q)|k +4) (6.106) 
q 


is a pseudowave function. By substituting the above relationships 
into the Schrédinger equation, we find 
(T+W)|Dy) = Ey |®,), (6.107) 
where the pseudopotential W is given by} 
W =Y (1) + (Ee — H) (Pat Po). (6.108) 


By applying the Hamiltonian H = T + Y to the core states we ob- 
tain the Phillipps-Kleinman pseudopotential generalized to the case 
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of an alloy, 


W=Y(r)-+ > X (Ex E ry, At) (rj, At] 
HA) t 


+S) X (Ek E?) Irj Bs) (rj, Bs]. (6.409) 


-i mm 


HB) s 
By generalizing the optimization procedure of [74] for an alloy and 
linearizing the equation we find 


UW =(1—Pa— Pp) Y + (k | ((—Pa4— Psp) Y |k) 


“kapa Pp iky (Pat Pa). (6.110) 


Applying the operator Y == H — T to the projection operators 
yields the alloy generalization of the optimized Harrison pseudopo- 
tential; 


W\ky=Y|k) -4 SN (A24 Ok | |) — EP) |r;, At) j Atk) 
JA) t 
+3 S (kyky jk) — E?)\r;, Bs) rj, Bs|ky. (6.141) 
iB) s 


We are now in a position to evaluate the diagonal and off-diagonal 
matrix elements of the pseudopotential; they may be expressed in 
terms of single-ion pseudopotentials. By making use of partial struc- 
ture factors, the matrix elements may be written as 

(i) off-diagonal: 


k +a |k) = S4 (q) (k-+ q| W4|k) + Sg (a) (k +a Welk), (6.112) 
where 


(k-+q|W,|k)=Y, CER (k24 (k |W [k) 


— Ef) (k+q|0, At) (0, At|k), 
(k+ al Walk) = Ys (a) +È (K+ |b) 
—E®)(k-+q|0, Bs) (0, Bs|k). (6.1413) 
(ii) diagonal: 
(KIW |k) = (1— c) (k| Wa |k) + (k| Walk), (6.114) 
where 
(kWa) = Y4 (0) +(1—(k| P|) 
x È (M+ Y— EP) (0, At|k)l?, (6.145) 
(k|W palk) = Yn (0) + (1—(k| P| ky) 
x © (k?+Y — B?)|(0, Bs|k)|?, 


fi 
s 
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where we have used the notation 


Y =(1— c) Y 4 (0) +e¥s (0), 


(k|P|k) = (1—6) $| &]0, 40+ eX kjo, Bs)j2, 4O) 
t s 


The important qualitative conclusion to be drawn from these re- 
sults is as follows [428]. Since the optimized formfactor of either com- 
ponent depends on the other component, it differs from the pure- 
component formfactor and this difference cannot be reduced to a 
mere change in the specific atomic volume or in the density of va- 
lence electrons. 

Since the self-consistent crystal potential is unknown, we start 
with electron-ion potentials which are substituted into the above 
formulae in place of W, (W p). This is followed by a linear screening 


procedure using a uniform electron gas of density p = ZQ-! (the 


atomic volume Q should be found from a thermodynamic stability 
condition). As a result, the matrix element of the self-consistent po- 
tential takes the form 


(k+-q|Wa, slk) =(k+q|W%, alk) + Wan (q), (6.117) 


where 
ser A (1—f ( < (k+qa1W% plk) 
WE n(o = Seo ee | -eker k, (6.118): 
IRI<R p 
e* (q) =414+[1— ¥ (4) le (0)— 1). (6.119): 


Here the function f (q) takes into account the exchange and correla- 
tion corrections (in [428] these are given in the form obtained in [429]). 
The electron-ion potential used in [428] is evaluated by the Y, meth- 
od, as was the case for pure metals. The important features of this. 
process are the following. 

First, only one free parameter is employed to describe many- 
electron effects. Second, the normalized OPW density is made up of 
a constant term Qt (1 — <k | P |k))~', and site-centered localized 
terms 


N-(k|P|k) (1 —(k|P|k))-! and N- (k} Py k) (1 — (k| P[k)“ 


which describe orthogonalization holes. Taken together with ion 
charges they give effective valencies 


4 (k | Pa) Lk) 
Zam=Zantay D PIE (6.120) 
IKI<A p 
different from the valencies of pure metals. Third, the theory makes 
it possible to evaluate the energy eigenvalues of the ion-core 
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electrons: 

=> 18 

FEAP = —-B(F) Arr eA . (6.121) 
"There Y? and Y* are the averaged (at g = 0) matrix elements of 
the Coulomb potential (for conduction and core electrons) and the 
exchange potential respectively; ex? are the ion eigenvalues, 
y°PW is the orthogonalization hole. It should be stressed that 
these relations are only true for completely disordered alloys; other- 
wise the effect of nearest neighbors on the core states must be con- 
sidered. 

The pseudopotentials obtained were used for evaluating the total 

energy which, as in the local case, contains the free-electron, band 
and electrostatic contributions. The free-electron energy is 


U = — 

















2 Zii bt set 2 (Ye +Y”) 
a aa A yopw | (k | 0, At) |? 
£ p__.1 yOPW) _|(k 10, Bs) |? 
E 5 (epe +3 my éa sap , (6.122) 
F S 
and for the band energy we have 
Urs= 2 ISDA (+2 IC (q) [2A F (q), (6.123) 
‘where 
F (9) — 22 i(k-+q |W Ik)? 2 IW Co) |? 
O= ah W= Ikta]? a aa ? (6-124) 
F 
FIn 2 I(k--q | AW | k) |? q? | AWscr (q) |? 
OF ear | Tecka ter tpi + Ot 
KIS? p 


It is worth noting that (6.123) is similar to the corresponding local- 
potential results. Eqs. (6.123) and (6.425) differ from the analogous 
local-theory expressions in that the local potential is replaced by 
a nonlocal one. It follows from (6.124) that for a completely disor- 
dered alloy 


5 5s 
An = gyre (1—0) \ AF (q) dq. (6.126) 


The relations derived in this section are particularly valuable in 
that they are rigorous and may therefore form a basis for more 
consistent and fundamental studies. It should also be noted that 
the obvious possibility of changing to local theory expressions 
{merely by replacing nonlocal by local potentials) justifies the 
earlier efforts in the field. 


Chapter 7 


Pseudopotential theory of alloys. 
Structure stability application 


7.1. Phase boundaries in terms 
of pseudopotential theory 


There are two major questions which an alloy stability theory 
must answer, namely, which phases can exist in a given alloy and 
when. One of the earliest attempts to attack this subject in terms of 
pseudopotential theory was undertaken by Stroud and Ashcroft [434]. 
As no reasonable theory can do without an explanation of the Hume- 
Rothery rules (see Chapter 6), it was on this point that they focused 
their attention. 

The internal configurational energy is represented in [434] in the 
usual way as the sum of the electrostatic and band contributions. 
The former is written (in ryd per electron) as 


1 X(ZyZy) 


= —— -—— 7A 
INZ A [ty—t,[’ (7.1) 
VV 


es 


where the brackets denote an average over all possible configurations 
of the A and B ions. As these are assumed to be distributed at com- 
plete random, we get (Z.Z) = Z°, and in the Fourier representation 
we have 


1 BaZ aro l 
Ues =- Linge NIS (0) bu, s, — 1) (7.2) 


q 





where S’ (q) is the Röntgen structure factor divided by the number 
of atoms in the unit cell. 
The band energy term was written as 
gh ern r 4 

Um = Boy MPS (ed? aat] (7.3) 
From (7.2) and (7.3), the relative energies of the FCC, BCC and the 
ideal c/a ratio HCP structures of the alloy systems Cu-Al, Li-Mg 
and Cu-Zn were calculated using the pseudopotentials W? = 
81Z/g? and W? = (8nZ/g?) cos gir, (whore r, is the core radius 
and is fitted to experimental data). There are a number of approxi- 
mations which are made in the above expressions and which were 
quite natural when the study was made. The virtual crystal approxi- 
mation is used both in (7.2) and (7.3); there is no Ewald exponential 
in the sum over q in (7.2); and the exchange-correlation corrections 


16—01933 
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are omitted in (7.3). Even so, interesting qualitative results were 
obtained. 

The major result is clearly illustrated in Fig. 7.1. As the radius 
of the Fermi sphere becomes equal to half a reciprocal lattice vector, 
the energy of the corresponding structure decreases sharply relative 
to the other two. In other words, a fall in the energy of a crystal 
structure occurs when the Fermi sphere touches the Brillouin zone’s 
face. The effect was observed in transitions between various pairs 


Z 


AU, 1073 Ry/el 





0.0 0.2 0.4 0.6 0.8 1.0 


Fig. 7.4. Energy differences between the HCP and FCC structures (bold line), 

and BCC and FCC structures (dashed line) for the Cu-Al alloys. The arrows indi- 

cate the g, for the HCP (1), FCC (2), and BCC (3) phases. Figures in parentheses 
are structure weights. 


of phases and could not therefore be dismissed as fortuitous. The 
reason for this effect is, presumably, the discontinuity occurring in 
the third derivative of the energy with respect to electron concen- 
tration at g, œ~ 2kp. This discontinuity causes a precipitous drop in 
the energy of the corresponding phase and hence changes the sign of 
the energy difference between the phases. For elaboration of these 
views see [419]. 

Hence, when the Fermi sphere comes in contact with the face of the 
first Brillouin zone, a phase transition takes place. Though this 
observation is an agreement with earlier views, the underlying rea- 
son for the effect is somewhat different from the one assumed earlier 
to be the case because Uy, has no minimum at g, œ 2kp. Although 
the sums involved in total energy calculations were carried in [434] 
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over about 160 reciprocal lattice vectors, it is interesting that each 
sum is dominated by a vector which corresponds to the contact 
between the Brillouin zone and the Fermi sphere. Jt should be noted 
that although Stroud and Ashcroft were rather successful in explain- 
ing the Hume-Rothery rules, their mathematical formalism is. 
not entirely rigorous. 

Interestingly, Stroud and Ashcroft’s conclusions were supported 
by Kogachi and Matsuo who published their results [435] at about. 


0% Nig 60% Mg 90% Mg 
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Fig. 7.2. Positions for the gọ and g, for the FCC (1), BCC (2), and HCP (3) 
phases of the In-Mg alloys. 


the same time. In the latter pseudopotential alloy theory was applied’ 
to an investigation of the phase boundaries and structure stability 
of the alloy systems In-Mg and Al-Mg. Since all the three elements. 
are not transition metals it might be expected from the start that 
the results of [435] should be more accurate than those of [434], in 
which the alloys studied were mainly noble-metal ones. Kogachi 
and Matsuo, however, not only dropped the fluctuational term but 
also neglected the difference between the electrostatic energies of 
neighboring phases (assuming the phases to be simple). In both [434]: 
and [435] the band energy does not include exchange-correlation cor-- 
rections in the dielectric permeability. Thus in [435] a binary alloy 
was in fact treated as a pure metal, the only “binary” effect beings 


a change of Z and Q with composition. As a result, the quantitative- 
16* 
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analysis more or less reduced to a determination of the relative posi 
tions of gy and the first reciprocal lattice vectors. 

Figures 7.2 and 7.3 illustrate the positions of gy, 2k, and the 
structural sites of the FCC, BCC and HCP structures (with their 
weights) for the alloy systems In-Mg and Al-Mg. It can be seen for 
the In-Mg system that in the absence of Mg, qo turns out to be close 
to the structural sites of close-packed structures, which explains the 
tetragonally distorted structure of In. When Mg is introduced the 


0% Meg 40% Mg 80% Mg 
0.74,08 09 1.0 07 08 0.9 1.0 07 08 09 1.0 
| l 


20% Mg 60% Mg 90% Mg 
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l 
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| 


| 
| 


} | l | | i l | l 
3 
Fig. 7.3. Positions of the qo and g, for the FCC (1), BCC (2), and HCP (3) 
phases of the Al-Mg alloys. 


value of q falls to lower q's whereas the reciprocal lattice sites are 
raised to higher ones. As a result, the degree of tetragonality decreases 
with Mg concentration and at a certain composition the appearance 
-of the FCC structure becomes possible. At about 80-90 at. %Mg, 
the (200) site of the FCC structure moves into the region q > 2kp 
thus giving rise to an HCP structure which is also typical of 
pure Me. 

For the Al-Mg system, the situation differs in that even for low 
concentrations of Mg, the values of g, are greater than qo in all the 
major structures. Accordingly, the low Mg concentration favors one 
of the close-packed structures, namely FCC, while HCP is only 
favored from 90 at.% Mg, at which gyo99) become greater than 2kp. 

The predicted behavior of phase transition correlates rather well 
with what is actually observed in these systems, though the com- 
plex structures are not describable in terms of the above theory. 
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The condition g,=2k, is thus the key factor to determine phase 
boundaries in this work [435]. Another important factor when 
determining phase structure and phase transition points is the posi- 
tion of g, in relation to qo 


7.2. Ordered phases, their structures, 
and existence conditions 


With the notable exception of the very first paper [398], all early 
pseudopotential calculations on ordered phases were performed to- 
second order in local pseudopotential. Even with this restriction, 
the results obtained were important enough to encourage further 
application of pseudopotential theory to the study of ordered alloys, 
intermediate phases, etc. [399, 436-438]. 

It was usually assumed that the atomic volumes and valencies of 
the components were the same and exchange-correlation effects were: 
neglected. Given these approximations, the band energy was written 
in the form 


Ups =- DIS (En) I71W (Bn) ?€ (Bn) x (En) 


+e D D, (a) eS, (a)—(1—0) Sa (a), (7-4) 
q 
where 


S4(q)= 2 exp [—igty4)], Sz (9) = 2 exp [— igty sy] 


VA 


are partial structure factors, and @, (q) = | AW (q) Pe (q) x (a) 
is the characteristic function of alloying. 

The first sum in (7.4) represents the energy of the average crystal 
and is associated with the arrangement of crystal lattice sites; the 
second term depends on the way the atoms of different sorts are 
distributed over the sites. For a completely random alloy, for 
example, the second sum is 


Uys, rand = e(1— c6) > D, (q). (7.9) 
q 


The effect of alloying is determined by the difference between the 
energy U,, as given by (7.4) and the averaged energy of the pure 
components, 


A = Uys, au — CU ss —(1—¢) Uĵs- (7.6) 
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Explicitly, 
A=N*S D, (a) eSa —(1— c) Spl? 
q 
HNSI [S (gn) |e (gn) X (En) L/W (Bn)? 


— (1—6)? |W 4 (gn) |? 07] Wp (Bn) I 
=U, — Ne (1—6) 2 |S (Gn) |7|AW (g7)1e (gn) x (8n). (7.7) 


Here the first term is the energy of ordering and the second is the 
amount by which the energy of the average crystal changes on alloy- 
ing. For a disordered alloy, 


A=+e(1—e) {3 Dla Dalen) 18) 
q n 


Figure 7.4 shows the characteristic alloying function ®, (q) for the 
Hg-Meg alloy. The positions of the (structural and superstructural) 
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Fig. 7.4. Characteristic alloy function Fig. 7.5. Alloying potential ®, (p) 
Da (q) for the Hg-Mg alloys. Below— for the Hg-Mg alloy. 
‘structural and superstructural sites of 

the BCC and BCT structures, 


reciprocal lattice sites and their respective weights are also shown. 
It can be seen that large magnitudes of ®, (q) correspond to the 
positions of superstructural sites. This leads to an increase in the 
ordering energy as a whole. 
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The configurational energy of an alloy may also be written in 
terms of the pairwise alloying potential 





Za— Zp)? Q , 
D, (p) = S25 28" +. | AW («)|%e (9) x (9) exp (ap) da, (7.9) 


pọ being the interatomic separation. 

Figure 7.5 shows the pairwise alloying potential Ð, (o) for HgMg, 
together with the number of like and unlike neighbors for the ordered 
structures HgMg and Hg,Mg,. It can be seen that the maximum 
in Ð, (0) falls in the region of p for which the number of unlike 
neighbors is increased on alloying. Clearly, this correlation between 
the maxima of @, (0) and interatomic separation makes the total 
energy decrease with ordering and is therefore a mechanism for the 
appearance of order. 

A comparative analysis of the ordering energy was carried out by 
inglesfield in his work on the (equiatomic) CdMg, HgMg and CdHg 
systems [437]. In calculating the matrix elements W (q) he used 
a spherically symmetric square-well pseudopotential of depth A 
and width Rm- The depth A was chosen so that the magnitude of 
the minimum in ®, (q) was equal to that found from a model poten- 
tial calculation [405]. The parameter Rm was taken to be the same 
(Rm = 2.6 a.u.) for all the systems. In this approximation, the 
value of A turned out to be proportional to A? while A itself was 
proportional to the difference in the Pauling electronegativities of 
the components. This has led Inglesfield [437] to the conclusion that 
the alloying and ordering effects are determined by A?, that is, by 
electronegativity differences between the components (provided, 
of course, that the alloy is essentially metallic). 

The ordering energy of the phases studied was found, for HgMg, 
CdMg, and CdHg, respectively, to be 3.8 x 10-3, 0.8 X 10-3, and 
0.5 X 10-3 a.u. per ion. In the HgMg system the ordering energy 
turned out to appreciably exceed the thermal energy at the melting 
temperature, indicating that an ordered state persists up to the 
melting point in this particular alloy. In CdMg the computed order- 
ing energy was found to be in fair agreement with the experimental 
ordering temperature. A comparison of results for CdMg and CdHg 
suggested that at about 300 K the CdHg system should go through 
an order-disorder phase transition. As a phase transition is indeed 
observed in this system at 260 Kk, it was reasonable to argue that it 
should be of the order-disorder type. 

In his structural analysis of ordered phases [438] Inglesfield pointed 
out that for spherically symmetric square wells the alloy energy may 
be written as 


U=U+A’A,, (7.10) 
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where U is the weighted average energy of pure components and 
Am = AIA?. 

If A? is very small, clearly the total energy will be determined 
by the energy of the pure components and the alloy structure will 
be very nearly that of the constituents: if A? is large, Inglesfield 
suggests, the resulting compound must be ionic. 

In the same series of papers [394, 436-438] there is an interesting 
attempt to estimate the range of existence of an ordered phase. 
This was done by considering the change in the ordering energy 
when Nô atoms of species A were replaced by the same number of B 
atoms arbitrarily distributed over the sites originaliy occupied by 
the A atoms. The energy difference for an ordered phase having 
(1/2 + 6) N of the B atoms and 1/2N atoms of the same sort is 


Ua, ord+ô Ua, ord ~~ 46 (U2, disord —" Ua, oral- (7.11) 
Consequently, 
U 
a =U,—U, +4[U., disora — Ua, ord]. (7.12) 


It follows from this that the higher the ordering energy, the strong- 
er the singularity in dU/de at c ~ 1/2 and hence the narrower the 
range of existence of the ordered phase. 

In [439-441], the above theory was extended to include (ordered) 
alloys with components of different valencies. 

The ordered phases considered in [439] consist of components 
whose ions have different charges and radii. The formulae for cal- 
culating the energy of the phases are basically similar to those 
given in the previous chapter. There are some differences though. 

For example, the band energy is written in [439] for an alloy with 
short-range and long-range order. This, in fact, is formula (6.39) in 
which the short-range order paramotor a (0;) is replaced by 


P (Pm) = 0 (Pm) + pene n0 (Ps (7.13) 


where œ (Ọm) is the short-range-order parameter and the second 
term describes the way atoms of different sorts are distributed over 
the a and p sublattices in the presence of long-range order of 
amount y (the atoms A and B are assumed to be distributed arbitrarily 
over the sites of either sublattice); a and Eg are the concentrations 
of the & and 6 sites; Om is a geometry factor determined by the 
structure of the ordered alloy. 

For the superstructure £1,, 


© (Om) = e2ims, (7.14) 
for £1, and £2), 
o (Om) => (e27imi | g2ctime 4 + e2mims) | (7.15) 
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In both expressions m,, M, and m, are integers or half-integers. If 
there is short-range order and some amount of long-range order in 
these structures, 6 > takes the form 


AUH — r > a tone IAW? (q)|2 





"agen, +e c) 2 Cio we b. (7.46) 


It should be kept in mind that instead of introducing the usual 
weight functions, Katada et. al. [440] sets Ogg = = 1/3 for q = 


£n,ss in the superstructure ¢1,. It is also worth noting that, by 
definition, @ (0) = 1 and ø (0) = 1 so that a) = 1— (Ëa Eqn2le(1— 
c)). 
The electrostatic energy is written in [440] as 
_ 23 . 
Ues = N 2 > ae {> >) Z (ty) Z (ty) exp (— iq, t, —ty-) 


q v v 


— D> Z(t} (7-17) 


where Z (t,) is the charge at point t,. By isolating the energy of 
the average crystal we find 


Ues =U es + AU es = >» F es (a) (5a, Enss w} 
q 








oo 1 singp; e(—ce) 
HD Fes (0) {Eaben*Sa, gp, s tae EO Ds Cres =, -EF 
q t 
(7.18): 
where 
. nm (Z? , 2a AZ? 
Fes (= a and Iel) F e 019) 


Thus, if we assume the volume to be constant (which is not always the 
case), the energy associated with the appearance of long-range order- 
may be represented as 


Vora = AU es + AU 5s = EaEpNnax > [Sys (q) 
q 


+ Ges (0) {ôa sn, 55— ar f- (7-20) 


Katada et al. point out that AU,,'is always negative and favors order 
in the arrangement of the atoms. At the same time, AU), is chiefly 
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determined by the behavior of ẸFps(q) at q’s smaller than the first 
zero of the function (Fps (q) < O in this region). 

In alloys for which Z4 = Zp, an ordered state may only occur if 
AUņs is negative because AU., = O in this case. If, however, Z4 =Æ 
Zp, the value of U ra may also be positive, provided that the 
positive contribution, AUy,, outweighs the negative one, AU,s. 
Even though the above expressions were derived by neglecting the 
Ewald term and the exchange correlation effects, they can reasonably 
ibe used for numerical estimates of the energies of ordered phases. 
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Fig. 7.6. The energies of the B’- and ĝ”-phases as functions of composition (the 
In-Mg alloy). 


Numerical results for In-Mg alloys are shown in Fig. 7.6 for 
the region where the ordered phases of InMg (£10) and InMg, (£1,) 
exist. The calculations were carried out with an optimized potential 


, 


taken from [154]; the sum t 5 Gys (q) was replaced by an integral. 


The results show that the ordered phase of the #1,-type structure is 
stable in the range 40-65 at.% Mg, that of the £1.-type in the range 
65-80 at.% Mg. In both cases the energy minimum corresponds to 
the stoichiometric compositions. 

If a completely ordered state is considered and the Gorskii-Bragg- 
Williams entropy contribution is assumed, then the transition tem- 
perature T, for the In-Mg alloy will be T, = — Ura (Kg In 2)? and 
for In-Mg, T,= —Uorq [Kg (2 In 2 — 0.75 ln 3]. The comput- 
ed temperatures will be, respectively, 1000 K and 1200 K whereas 
the observed ones are 600 K and 620 K. This discrepancy, Kogachi 
argues, cannot be due to a poorly chosen potential because a poten- 
tial taken from [405] only increased the predicted temperature by 
another 10%. This argument is not, of course, entirely satisfactory, 
but the true reason for the discrepancy remains unclear. 

In [440], the above theory was developed to include alloys with 
more than one atom in the unit cell. Specifically, HCP alloys were 
investigated. The expressions for the band and electrostatic energies 
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were found to be 


Uys = >» F vs (q) On, 
q 








i — sin 4p; 
AUys= >) Frs (q) finto + AE 2i C:a; a, \ , (7.21) 
and i 
' 1 
=>) F F es (q) {o, —+} ’ 
q 
Ave = By Se (D {tio — “SES + EAD Ce a}, (7.22) 
q 
where 
on =p {1 ++ wta) ô (d; gn)» (7.23) 
v(q) =exp | —iq (=. aya : az + -5 za) |. (7.24) 


The exact form of o’ depends on the type of the superstructure. 
For example, for #19 


, 4 4 5 X = 
o=- {145 vay a} ha, gn +4) Io, (7.25) 
for the superstructure DO 4g 


d'= -i (on +0?) oE a), (7.26) 


oP =A {1—5 aH a} (a ga HA) (7.27) 
wata inotraola gn HRE). (7.28) 


The ordering energy calculations for the Cd-Mg system are very 
interesting. For the compositions CdMg, and Cd,Mg the ordering 
energies were found to favor the DO, structure, and to favor #19 
for GdMg. These are indeed the structures observed in experiment. 
The phase stability boundaries calculated for these structures were 
also found to be in agreement with experiment. It is interesting to 
note that for the equiatomic CdMg system the results [440] are to 
within 20% of those in [437]. 

. Since ordering phenomena are characteristic of noble-metal alloys, 
it was natural to apply pseudopotential theory to this case. Alloys 
based on Ag, Au, Mg and Al were studied by Kogachi [441] using the 
pseudopotentials proposed in [343]. These potentials, however, 
contain additional repulsion terms due to the overlap of d states, so 
some corrections had to be introduced into the theory. 
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With a repulsion potential, the structure-dependent term of the 
total energy has the form 


U = Ung + Ues + Urp, (7.29) 


where the repulsion energy Urp may be written for the BCC and FCC 
structures as 


Ur = sar D Dy (Pa (t) Pa (ty) Waa (ti — tl) 


i ji 
+ Ps (ti) Pa (bj) Wap (\ti—tj]) — [Pa (ti) Pr (ty) 
+ Pp (ti) Pa (ty) Wan ([ti—tj])}. (7-30) 


Here the fluctuational and average crystal terms are easily separated. 
By combining them with known results for the band and electrosta- 
tic energies we find the energies of a completely disordered and an 
ordered crystal: 


Uenaot = {Fs (d) Sq, pt Fes (a) | [ Sa, tu av |} 


q 


tAr -CW (p:), (7.31) 


: l o ji 
Vora = Ucnaot + EatBnax ` [Sos (q) + Ges(Q)] o Bn, ss =| 
q 


+ >) mss AW (p;). (7.32) 


i 


Here &; max are the limiting values of the short-range order para- 
meters for the ith coordination sphere; W,, (9;), W BB (pi) and 
Wap (pi) are the repulsion potentials for, respectively, AA, BB 
and AB pairs (separated by 0;); and AW (p) = Waa + Wg, — 


2Way, W = (1 —c)?Wa, + EWag + 2c(1 —c) Way. It was as- 
sumed in calculating the repulsion potential that it remains unchanged 
on alloying, operates only between noble metal atoms, and is 
only substantial for the first four coordination spheres. 

Pure Mg and pure Al were predicted to be stable in, respectively, 
the HCP and FCC structures which are actually observed in these 
metals. When either Ag or Au is added, then at a certain concentra- 
tion of a minor component, the Ag-Mg or Au-Mg system becomes BCC 
and the Ag-Al system becomes HCP and so on. If, however, the 
concentration of the noble metal is increased further, the computed 
(energetically favorable) structures depart from those observed, which 
is consistent with [313]. Thus, although the pseudopotential in [8413] 
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is not good enough to predict the favorable structures in pure metals 
or noble metal-rich alloys, it is still useful for moderate noble metal 
concentrations. 

For ordered phases, only the energy calculations on the Ag-Mg sys- 
tem proved to be successful. It was found in this case that for £2,- 
alloys in the B’ plase, an ordered state is possible when the positive 
contribution from the band energy is outweighed by the negative 
contribution from the repulsion and electrostatic terms. 

It is pointed out in [441] that the perseverance of an ordered phase 
up to the melting temperature cannot be explained unless the repul- 
sion energy is taken into account. Even if it is, however, the results 
for the Au-Mg system disagree with experiment. The prevalence of 
the positive band-energy contribution makes the ordering energy 
positive and thereby precludes the existence of an ordered phase in 
this system. 

As was mentioned in Chapter 6, a promising and conceptually 
different way of using pseudopotentials in alloy theory in general 
and in ordering problems in particular is to invoke the statistical 
thermodynamic theory of atomic ordering. It is worthwhile here to 
discuss some of the applications of this approach. In [402] it was 
employed in a stability analysis of an ordered phase in an equiatomic 
Ca-Ba alloy. A model potential used in the calculations was taken 
from [442]. 

In the disordered phase, the lowest energy was found in the actually 
observed BCC structure. The next step was to calculate the function 
V (Aq) in the [100], [110], and [111] directions. This particular choice 
of directions was suggested by a statistical thermodynamical analy- 
sis, which showed that for the BCC structure the possible stars of 
superstructural wave vectors have minima of V (g,,,,) at the (100) 
or (111) points (CsCl-type superstructure), at the (1/2, 1/2, 1/2) 
point (NaTl-type superstructure), and at the (1/2, 1/2,0) point 
(their superstructure is similar to that arising in implantation 
phases). 

The numerical results represented in Fig. 7.7 predict for the Ca-Ba 
system a CsCl-type superstructure and 7, (as derived from the value 
of V (Zn,ss5)) of about 890 K. Experimentally the order-disorder tran- 
sition occurs at about 400 K in this system and the superstructure is 
rather of the NaTl type. 

We may thus conclude that the approach suggested in [401, 402] 
can also be used to interpret experimental data. The appeal of the 
approach is its insensitivity to low-q “discontinuities”. We noted ear- 
lier that the low-q behavior of formfactors may substantially limit, 
sometimes perhaps prohibitively, the applicability of pseudopoten- 
tials to alloy theory. It should be noted, however, that the value 
of [402] is greatly reduced because only the model potential from 
[442] was considered. 
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The theory proposed in [401, 402] was elaborated in [420, 443-447} 
for the case of complex-structure alloys and was then applied to 
calculations of some alloy properties that are associated with the 
total energy. The first problem was to find the equilibrium lattice 
parameters and energetically favorable superstructures for Cd-Mg 
alloys. The model potential of [442] was used, its parameters being 
determined by setting to zero the potential formfactor and the deriv- 
atives of the energy with respect to the volume and to the axial c/a 


V(k),Ry 





Fig. 7.7, The Fourier transform of the alloying potential of the Ca-Ba a loy 
against vector k along the [100], [444], and [410] directions in the first Brillouin 
zone. 


ratio. Consequently, there were three equations for the two parame- 
ters to be fitted so that it was possible to obtain the equilibrium 
value of c/a by considering it as the third parameter. The exchange 
correlation corrections were taken from [448]. 

The total energy calculations for the pure metals Mg, Cd, and Zn 
indicate, in agreement with experiment, that all three must be HCP. 
The computed equilibrium volumes were also close to those mea- 
sured. As for the c/a ratio, however, a satisfactory result was only 
obtained for Mg. For Cd and Zn the computed values turned out, 
as occurs in most papers, to be lower than the experimental ones 
(1.63 instead of 1.85-1.89). Calculations of mixing energy showed 
that it is negative for all compositions, in accordance with the 
observed unlimited solubility of the components of the Mg-Cd sys- 
tem. An energetically favored phase, however, could only be calcu- 
lated for the equiatomic alloy. 

Various applications of the above theory may be found in subse- 
quent papers in this series (see [447] for a review). For example, the 
configurational energy was calculated by Fuks, Zhorovkov, and 
Panin [442] for the disordered solid solutions Mg-Zn, Cd-Zn, Ba-Cd, 
and Cd-Mg. That the concentration dependence of the mixing energy 


Ch. 7. Pseudopctential theory of alloys. Structure stability 255 


is in most cases nonparabolic isan extremely important result show- 
ing that the widely accepted model of regular solutions is rather 
inaccurate. In the first three systems the mixing energy was found to 
be positive, which suggests the existence of a low-temperature disso-- 
ciation of these alloys (which is indeed observed). 

It is pointed out that generally each term of the total energy 
contributes to the mixing energy, and it is therefore impossible to: 
form a criterion for predicting the sign of AU. Although this con- 
clusion is somewhat at variance with [434, 435] it should be born. 
in mind that in these two papers the availability of a criterion was- 
only verified for a very limited material and remains therefore: 
open to question. 

Another interesting conclusion to be found in these papers is that: 
the energy parameters may change both during alloying and during 
ordering, if for no other reason than because of changes in specific 
atomic volume. Fuks, Zhorovkov, and Panin also confirmed the re- 
sult obtained in [436, 437] that a rise in AU narrows the existence: 
range of intermediate phases. The demonstration of the possibility 
of calculating the mixing and ordering energies, and the equilibrium: 
volume of an alloy has general importance [420, 443-447]. 

It is worthwhile noticing here that the authors of [443-447] are- 
quite right in emphasizing the difference between the mixing energy: 
and the ordering energy. They define the former as the energy differ- 
ence between the alloy and its components and the latter as that 
between the ordered and disordered phases. It is pointed out in {447} 
that AUnix and AU,,, may even differ in sign. In Table 7.1 we 
list the results of [447] for the mixing and ordering energies and for- 
the equilibrium volume of the Mg-Zn system at normal conditions. 
and at a pressure of 5 X 101° Pa. 

Table 7.1. Equilibrium volume, mixing energy and ordering energy for the 


B 19 superstructure of Mg-Zn (upper row [for no pressure, lower 
row for 5x101 Pa) 








Sve | 0 | 0.1 | 0.3 0.7 | 0.9 | 1.0 


0.5 
Q (a.u) 101.9 107.4 118.4 129.3 140.0 150.0 156.0 
74.9 77.8 33.6 89.1 94.4 99.5 102.0 
AU mix — 3.12 6.64 7.23 5.64 2.32 — 
(10-3 Ry) — 2.74 6.30 7.40 5.03 2.21 — 
— AU ora — 0.24 1.63 3.96 1.25 0.12 — 


(1078 Ry) — 0.28 2.41 6.35 2.17 0.22 — 
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It can be seen from the table that the ordering energy and the 
‘mixing energy differ in sign and that uniform compression increases 
the magnitude of the former while decreasing that of the latter. 
‘The implication is that systems of this kind may show ordering at 
‘high pressures. 

We may conclude that the papers we have just reviewed demon- 
strate the usefulness of the pseudopotential method for the theory 
‘of metallic alloys or, more specifically, the usefulness of local 
pseudopotentials as treated in second-order perturbation theory. 

It is of interest, in this connection, to mention an application of 
higher order perturbation theory to a structure stability analysis of 
the binary intermetallic systems LiAl, LiTl and NaT1 [471]. A second- 
order total energy calculation predicted (see Table 7.2) that LiAl 
and LiT] should be stable in the #2 superstructure while NaTl must 
exist in a two-phase state (note parenthetically that when the exchange 
‘correlation effects are neglected, NaTl will also crystallize in the 
#2 structure). Experiment shows, however, that LiTl is stable in 
‘the @2 structure while LiAl and NaTI are stable in $732. So a second- 
order approximation is only satisfactory for the Li-T] system. 


Table 7.2. Total configurational energies of LiAl, LiTI, and NaTl, second- and 
third-order perturbation theory results 


Structure Be Br 








Ubs Ups | cov) | uR + u | UES | uf UID? + yEy) 
) DS JS > S S 

‘LiAl —0.0027 —0.0013 —0.0040 0.0004 —0 .0049 —0 .0045 

NaTl —0.0013 —0.0079 —0.0092 —0.0011 —0.0108 —0.0119 

LiTl —0.0048 —0.0017 —0.0065 —0.0003 —0 .0029 —0.0032 


A third-order approximation, as Table 7.2 shows, yields a far 
better agreement with experiment. True, the computed “ordering” 
temperatures differ substantially from those observed, but as a 
whole, there is no doubt that including the higher orders improves 
the results. 


7.3. Short-range order problems 


There is almost always some degree of short-range order in solid 
solutions, the nature of which is determined, in each particular case, 
by the way the atoms of the different species interact with each 
‘other. It would be desirable to be able to predict the sign and 


Ch. 7. Pseudopotential theory of alloys. Structure stability 257 


the type* of the order from the properties of the components. 

The problem is not only of purely academic importance because 
short-range order greatly influences [412] the thermal, mechanical, 
electrical and other physical properties of alloys. In particular, 
short-range order causes the anomalies in electrical resistance and 
some other properties found when annealing hardened and deformed 
alloys (the K-effect). Short-range order as opposed to long-range order 
may occur in alloys of any composition and is very often necessary 
in order to explain the changes in physical properties during pro- 
cessing. 

The early works on the electron theory of short-range order either 
employed the free-electron model [412, 449] or, when using the 
pseudopotential method, only the internal energy was analyzed [398, 
399, 339-441]. Some short-range order effects were qualitatively 
interpreted in this way. 

In further works on short-range order [404, 421-423] the minimiz- 
ing of the free energy was combined with the use of Eqs. (6.79)- 
(6.81) which yielded the parameter a, from the charges and pseudo- 
potentials of the ions of pure components. Other refinements in 
[422, 423] and subsequently were (i) the inclusion of the Ewald 
correction term in the electrostatic energy; (ii) the calculation of 
V (pọ) for the sum of the band and electrostatic energies rather than 
separately for each of them; this approach reducing errors due to low-g¢ 
discontinuities; (iii) an analysis of the exchange-correlation correc- 
tions [164]; and (iv) an analysis of the computational proce- 
dure [458]. 

Since short-range order is found in many noble-metal and transi- 
tion-metal alloys, it was with these alloys that the above papers were 
mainly concerned. Calculations were performed using the Animalu 
pseudopotential [405, 456] (for transition metals, [443, 455]), which 
has been defined and whose formfactors have been found [456] for 
a large number of elements. One reason for this restriction to this 
pseudopotential was to make the interpretation and comparison of 
results easier. 

The first objective of this series of papers [423, 450] was to check 
the predicting ability of the theory using as an example three alloys 
of the pentavalent metals V, Nb, and Ta. Experimental data on 
short-range order were only available for the systems Nb-Ta and 
V-Nb and it was known that despite their similar electronic struc- 
tures the two alloys differed in the sign of the parameter a,. There 
had been no explanation for this fact and hence no reasonable guess 
could be made at the possible sign of a, in V-Ta. The calculation 


* The sign of short-range order is that of œ, or, in case g= 0, of a,; the type 
is determined by the spectra of the parameters a, their ratios, and the alterna- 
tion of their signs. 
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yielded the correct sign for a, in V-Nb and Nb-Ta (negative and 
positive, respectively) and predicted a negative sign for V-Ta. This 
prediction was later confirmed by experiment [450]. 

This investigation was followed by the calculation of a, for more 
than forty other systems [451-454] for which the pseudopotentials 
and the (experimental) signs of the short-range order parameter œr 
were available. The important aspects of these results are as follows. 

It was shown that, in accordance with qualitative arguments about 
the asymptotic behavior of V (pọ), it is quasi-oscillatory for all the 
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Fig. 7.8. The ordering potential of the Ni-Pt alloys against the interatomic 

separation. The curves 1, 2, and 3 correspond to 11, 34, and 50 at. % Pt, respec- 

tively. Vertical lines mark p, values for each alloy, and A, B and C are the cor- 
responding values of V (@;). 


systems studied. This is clearly illustrated in Fig. 7.8 for the Ni- 
Pt system. These oscillations are not purely Friedel ones [430, 168}. 

It can be seen that the specific value of V (p) is dependent on 
composition even though its general behavior is not. 

If we look at V (pı) for the three curves (p, being the radius of the 
first coordination sphere) we see that V (p,) increases with Pt content, 
at least from 411 to 50 at.% Pt. This behavior agrees reasonably well 
with experiment [459] thus disproving the earlier conclusion of 
statistical thermodynamical alloy theory that the ordering potential 
is only determined by the nature of the components and is indepen- 
dent of their concentrations. 

The above results also show that any change in structure, if accom- 
panied by a change in interatomic separation, may affect the short- 
range order. Obviously, V (pọ) and a (p) are the most sensitive to 
separation where small values of V (p) are combined with large 
derivatives dV/dp. 


id 
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The factors that may affect short-range order via interatomic 
separation are temperature, pressure, and defect-producing treat- 
ments, whether thermal, mechanical, or due to radiation. The direc- 
tion in which the order changes depends on the sign of the derivative 
dV/do and the value of V (pọ) itself near 9,. 

Since plastic deformation, defects, or radiation treatment may 
increase the separations in some regions of the crystal and decrease 
it in others, the ordering potential may vary throughout the crystal, 
thus giving rise to non-uniform short-range order (local order, [412]). 
This allows a pseudopotential interpretation of some experimental 
data [412, 460-467]. For example, the systems Ni-Pt [459], Cu-Al 
[460] and Cu-Au [461] have V (pọ) that varies with temperature (in 
copper-containing systems the effect can be measured directly at 
the investigation temperature). 

It was pointed out in [467] that besides kinetic factors, non-uniform 
short-range order may also result from differences between ordering 
potentials in the vicinity of defects. This may he ascribed in the 
present context to features of the indirect interaction via the conduc- 
tion electron gas. Unfortunately, the agreement with experiment is 
as yet far from being satisfactory, even in qualitative terms. As far 
as the parameter «, is concerned, the predicted sign turns out to be 
correct in only two out of three systems studied. The agreement with 
experiment is better if the alloy components have the same valency. 
Therefore the sign of a, for a given system may only be predicted 
in terms of probability. Curiously enough, there is no agreement 
with experiment for alloys composed of noble metals alone or in 
alloys of transition or noble metals with aluminum. Considering 
the nature of these elements, one reason for this may be the inade- 
quacy of Animalu pseudopotentials. It should be noted, however, 
that there are more successful calculations than when the free- 
electron approach of [468, 469] is used. 

In all papers cited, the a, parameter was calculated using pseudo- 
potential theory. In [470] the short-range order parameters of the 
alloys Ni-Fe and Ni-Pt were calculated for the first three coordina- 
tion spheres from the combined formulae (6.76), (6.77) and (6.82). 
Using second-order perturbation theory, the computed parameters 
4-3 had the right signs (except for the cases when their experimen- 
tal values were within the measurement accuracy), but the values 
themselves were rather unrealistic as was the case when evaluat- 
ing a, alone. The agreement was improved, though, by using a third- 
order approximation. 

The second- and third-order results are summarized in Table 7.3 
for a, and in Table 7.4 for a, &a, and &3. 

For most of the alloys the inclusion of the third order improved 
the agreement between theory and experiment. For the Ti-Zr system, 
for example, it provided the right sign of a. For Cu-Au, Ag-Au and 


17* 
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Table 7.3. Paramcter a. Second- and third-order results and experiment 














Alloy | ay? | ai? ay? 
Al-Zn (10 at. % Zn) 0.04 0.04 0.08 
Mg-In (10 at. % In) —0.04 —0.08 —0.08 
Ni-Pt (31 at. % Pt) —0.03 —O.11 —0.135 
Ni-Pt (11 at. % Pt) —0.002 —0 .032 —0.013 
Ni-Fe (25 at. % Fe) —0.040 —0.055 —0.098 
Co-Pt (50 at. % Pt) —0.06 —0.18 —0.14 
Ni-Pt (50 at. % Pt) —0.04 —0.16 —0.18 
Cu-Au (50 at. % Au) 0.28 0.45 —0.16 
Ag-Au (20 at. % Au) 0.16 0.07 —(0.17 
Ti-Zr (50 at. % Zr) —0.26 0.24 0.05 
Zr-Hf (50 at. % Hf) 0.027 0.030 0.04 


Ti-Hf (50 at. % Hf) —0.43 —0.12 0.02 


Table 7.4, Parameters @,, @o, and a, for Ni-Fe and Ni-Pt alloys (second- and 
third-order results) 


























Alloy | a? as”? | as”? | at? | ap 
Ni-Fe (25 at. % Fe) —0.035 0.017 0.004 — 0.050 0.020 
Ni-Pt (14 at. % Pt) 0.003 0.018 0.045 —0.033 0.019 
Ni-Pt (31 at. % Pt) —0.017 0.031 0.034 — 0.128 0.080 
Ni-Pt (50 at. % Pt) —0.042 0.035 0.033 —0.239 0.196 
Alloy | a”? | afXP aeXP aSXP | 
Ni-Fe (25 at. % Fe) 0.009 —0.098 0.4116 — 
Ni-Pt (141 at. % Pt) 0.014 —0.013 0.06 0.02 
Ni-Pt (31 at. % Pt) 0.044 —0.13 0.14 0.06 
Ni-Pt (50 at. % Pt) 0.086 —0.18 0.31 —0.01 








Ti-Hf, even though the sign of a,” was incorrect, its magnitude was 


moved in the right direction. There is an improvement for Mg-In, 
Ni-Pt, and Co-Pt too. 

Table 7.4 [470] shows that third-order calculations also represent- 
ed an improvement for the order parameter of the higher coordina- 
tion spheres. 
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Although the above results thus demonstrate the usefulness of the 
third-order corrections, there is every reason to believe that for 
some alloys at least, especially for those composed of polyvalent 
elements, other factors will be found to improve the theory. To 
begin with, the large magnitude of the third-order corrections seems 
to necessitate the inclusion of the fourth and probably even higher- 
order corrections. On the other hand, obtaining more adequate 
pseudopotentials and finding the real shape of the Fermi surface 
are becoming more essential. The role of these factors in the theory 
of short-range order is discussed in [432, 449, 472-475, 500]. 


7.4. Crystal structure stability 
in the OPW approach 


In order to obtain phase stability boundaries, the total energy as 
found in Sec. 6.6 should be substituted in the thermodynamic poten- 
tial H -- U + pQ — TS. OPW calculations of crystal structure 
stability were performed by Hafner for disordered alloys of the Li-Mg 
system [428] and for solid solutions and complex-structure com- 
pounds [476]. 

Hafner [477] started by calculating the characteristic functions 7 
and A¥, and the specific volume, and from these obtained the con- 
centration dependences of the internal energy (for the FCC, BCC and 
HCP structures), the equilibrium atomic radius, cohesion energy, 
isothermic elasticity modulus and the equation of state. The calcula- 
tions were performed throughout the entire range of concentrations 
in steps of 10%. The elasticity modulus and equation of state were 
obtained by a technique similar to that used in [477] for pure metals. 

Comparing his results with experiment shows that the computed 
atomic radii are correct to within 2 or 3% for pure Li and pure Me. 
In solid solutions they depart substantially from Vegard’s rule, 
which agrees with the experimental departure in sign (toward the 
lower values) but is by a factor of two larger. It is interesting that 
at concentrations for which 2k, is a reciprocal lattice vector an 
inflection-type singularity was observed in [477] as well as in [478]. 
The higher-pressure form of the equation of state also argees with 
experiment both for the pure elements Li and Mg and for their 
alloys. The predicted elasticity moduli are correct to within 25%. 
Of considerable interest is the calculation of the equilibrium crystal 
structures. 

Figure 7.9 is reproduced from [428] to show the energy differences 
of the FCC and HCP structures relative to the BCC structure. The 
arrows mark the concentrations for which the Fermi sphere comes in 
contact with the Brillouin (Jones) zone boundary. It can be seen 
that for Li-rich or Mg-rich solid solutions, the preferred phase is 


262 Part &. Pseudopotential theory in crystal-structure calculations 


HCP, for 35-80 at.% Mg it is BCC and for 15-35 at.% Mg it is FCC. 
With the exception of the FCC phase, which only exists in a metasta- 
ble state, the existence ranges predicted are close to those observed. 
Phase-to-phase transition regions (corresponding to sharp changes in 
energy differences) are close to Fermi-sphere/Brillouin-zone-houndary 
contacts. The reason for the energy changes at these points is, as 
we have already discussed, the steep slope of the screening function. 


AU.107? Ry/ion 





Li c Mg 


Fig. 7.9. HCP/BCC (bold line) and FCC/BCC (dashed line) energy differences. 
The concentrations marked by the arrows are those for which the Fermi surface 
touches the Brillouin-zone face (1—HCP, 2—FCC, 3—BCC). 


Figure 7.10 shows the concentration dependence of enthalpy, 
AH (c) = Uri-mg (c) — (1A—c) Uy — cUxg for a number of phases 
[428]. It can be seen in the region from 8 to 53 at.% Mg that the 
AH versus c curve is convex, which indicates that the mixture of 
BCC and HCP phase has a lower enthalpy than each one of them 
taken alone. Accordingly, the FCC phase cannot be stable in this 
concentration region even though its energy is the lowest. Under 
some circumstances, however, it is because of the lower energy that 
a metastable FCC phase may occur. The results of the calculations 
are in fairly good agreement with experiment and so provide good 
evidence for the adequacy of the theory. Hafner seems to be quite 
justified therefore in his conclusion that, given a reasonable choice 
for the pseudopotential, second-order calculations may be reliable. 

The next paper of the series was concerned with the formation 
enthalpies of alkali intermetallic compounds with stoichiometry 
AB,, A,B, and A,B; and the structure classified in [479, 480] 
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and their solid solutions. To gain more insight into the binding 
mechanisms in these systems, the coordination numbers of atoms 
were varied. The comparison of the band and electrostatic contri- 
butions, which required the determination of the effective interatom- 
ic potentials between the A-A, B-B and A-B pairs was basic to 
the study. 

For the solid solutions Li-K, Na-Cs, Na-Rb, Li-Na, Na-K, K-Rb, 
Rb-Cs, and K-Cs it was found that the predicted mixing enthalpies 


AH, 1073 Ry/ion 





Li c Mg 


Fig. 7.10. Formation heats of the BCC (bold line), HCP (dash line), and FCG 

(dash-dot line) phases of Li-Mg alloys as functions of composition. Common tan- 

gents show the existence ranges of the phases. Crosses show the experimental 
enthalpies of the liquid alloys. 


are consistent with the [lume-Rothery rule, which limits the mutual 
solubility of components with atomic dimensions differing by more 
than 15%. Furthermore, low entropy (that is, almost perfect) solid 
solutions very closely follow the Vegard rule in their concentration/ 
lattice-period dependence. In particular, this is true of K-Rb, Rb-Cs, 
and to some extent K-Cs. 

_The complex-structure phase calculations in [477] were the first of 
the kind and as such are of interest. We will limit our discussion to 
the results for three Laves phases AB,, namely, the cubic @15 (like 
MgCu,), hexagonal @14 (like MgZn,) and multilayer G36 (like 
MegNi,). Electrostatically, the most favored of these is the cubic 
phase. If, however, the band energy is also considered, then, in 
accordance with experiment, the hexagonal structure becomes stable. 
The reason is that the relatively shorter reciprocal lattice vectors 
make the characteristic function of this structure larger than that 
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of the competing structures. It is interesting that the greatest con- 
tribution comes from the interaction of like neighbors (AA, BB), 
whereas the AB-type interaction is about the same for all the struc- 
tures. Note also that in this paper the parameters of, say, the AA 
interaction, turned out to depend on the second component (B). 

Other properties were calculated, such as the formation enthalpy, 
the coordinates of the atoms, and the equilibrium values of the 
parameters a, c and c/a. These results together with the corresponding 
experimental values are summarized in Table 7.5. We can see a fair- 
ly good agreement between theory and experiment. 


Table 7.8. Equilibrium values a, c, c/a of atomie coordinates x and z. and 
formation enthalpy AH. Theoretical and experimental results for the Na,K 
and K,Cs intermetallic compounds [477}. 


Nagk KeCs 
Phase characteristics . 
theoretical | experimental | theoretical exe 


a (10-19 m) 7.62 7.50 9.56 9.07 
c (10-19 m) 12.79 12.31 16.26 14.76 
c/a 1.68 1.62 4.70 1.63. 
z (fractions of a period) 0.0635 0.0625 0.064 0.0625 
z (fractions of a period) —0.1675 —0.1667 —0.167 —0.1667 
AH (J g7? atom~!) —522 —605 —751 <0 


Figures 7.14 and 7.12 show the potentials of interaction between 
various pairs of atoms as calculated in [477]. It is easy to see that 
in the Laves structures of K,Cs and Na,K the minima of these poten- 
tials are in better agreement with the measured separations R than 
they are in the solid solutions Na-K and K-Cs. It was pointed out 
in [477] that the same situation occurs in other structures, for 
example in K,Rb,, K,Cs,, and Rb,Cs,. 

The reader may have noticed an inconsistency in our discussion. 
In Sec. 7.2 it was argued, with the reference to [409, 471], that a reli- 
able calculation requires third- or even fourth-order perturbation 
theory. In the present section we found it permissible to work only 
to the second order. However, there is no contradiction because the 
potentials employed in [409, 474] and [428, 477] were different. If, 
on the other hand, we assume the potential is good, it is not yet 
clear whether second-order perturbation theory will always 
do or whether in some cases it will not. 

Whatever may be the case, the OPW theory has proved very suc- 
cessful in stability studies, even for complex structures like the 
Laves phases. This, undoubtedly, will encourage further applications 
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Fig. 7.11. Interatomic potential for the K,Cs alloys. The arrows indicate R 
values for solid solutions and for Laves phases. 
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Fig. 7.12. Interatomic potential for the NaK alloys. The arrows indicate Æ 
values for soild solutions and for Laves phases. 
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-of the theory. It is perhaps worth noticing that apart from the differ- 
‘ences we have mentioned, this present section has an important 
point in common with Sec. 7.2. Namely, a good agreement with 
experiment is obtained through the use of nonlocal potentials, wheth- 
«r the nonlocality is inherent or arises from the use of higher-order 
perturbation theory. This finding implies, of course, the importance 
and necessity of nonlocal potentials. However, local approximation 
ds numerically much more convenient to use and must not be 
abandoned at present. 


Chapter 8 


Pseudopotential theory 
and imperfections in crystals 


8.1. Introductory remarks 


The single- and two-component crystals we studied in the preced- 
ing chapters were very idealized in that their constituent atoms 
were assumed to be fixed in the positions characteristic of a perfect 
lattice. There are two kinds of imperfections in areal crystal. Dynam- 
ic imperfections are the displacements of the atoms as a result (or 
rather manifestation) of thermal motion. Static imperfections 
include vacancies, interstitial atoms, dislocations, stacking faults, 
etc. Impurities are also ordinarily classified as static imperfections; 
this in fact limits perfect crystals to single-component ones. The 
disturbing effect of an impurity is two-fold in nature. First, it sub- 
stitutes a host atom at the atom’s site; this effect was already con- 
sidered in the two preceding chapters. Second, an impurity displaces 
neighboring host atoms from their equilibrium positions. 

In this chapter we are concerned with the effect of imperfections 
on the crystal energy as calculated in pscudopotential theory. This, 
however, is too vast a subject to be covered fully in a single chapter. 
Moreover, some aspects of the problem have been discussed by 
Heine et al. [17] and Harrison [73]. Our task here is only to outline 
the principles and to show a few examples of their application. 


8.2. Crystal lattice vibrations 


Remember that the crystal energy is the sum of the structure- 
independent term U, plus the structure-dependent term, which is 
in turn the sum of the band-structure energy Ups and electrostatic 
energy Ues. Thus far we have considered the crystal energy at the 
limit of no thermal vibration. Our purpose in this section is to 
lift this restriction. It should be noted that in doing so we remain 
within the framework of the adiabatic or Born-Oppcenheimer 
approximation. We will start our discussion with the band-structure 
contribution Ups. 

First, we must rewrite the formfactor I°? (q) of the crystal pseudo- 
potential. For the sake of simplicity, we use the local approximation. 
and we assume there is only one atom in the unit cell of the crystal. 
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We have 
f e- iktor? (r) e" dr, (8.1) 


o| > 


I or (q) = 
where 


Wè) =- >) W’ (r—t, —uy), (8.2) 


Yy 


u, being the displacement of the v-site atom from its equilibrium 
position. Substituting (8.2) into (8.1) gives 


a Í -i u ) 
W ex (4) = Fe Dy WE (gq) eM, (8.3) 


Vv 


If we assume the atomic displacements are small, it is legitimate 
to expand the exponential in terms of displacements and retain 
the quadratic terms, 


or 1 7 ~i . 1 | 
Wor (G) =- >, W (q)e {1— iqu, — 5 (quy)?+ -. z (8.4) 


v 


The quantity we shall actually need in the following is 


[Wer (= S emiatt £4 — iq (uy — uy’) 


vv! 


— -5 lau uy) h} (8-5) 


Since the average value of u, is zero, we may write 


b 2 : A 
wè p= LOE S eniai [1—5 la (u u)?}. (8-6) 
vv’ 

It is now convenient to transform to normal modes, which means 
that instead of the system of vibrating atoms, we shall be dealing 
with a set of waves (modes). The amplitudes of these are called 
normal (model) coordinates and we shall see later that they depend 


on the wave vector. We have 
N 3 
u= > È 2agee cos (a t — Q;ty —SQ,); (8.7) 
Q=1 j= 


where 2ag; is the amplitude of a wave having the wave vector Q 
and polarization j; eg; is the unit vector for the direction in which 
the atom is moved by the wave (for longitudinal vibrations, eq; ll Q; 
for transversal, eg,;_1 Q); and 5g, is the phase of the wave. 
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When evaluating (u,— u,-)?, we encounter terms of the form 
cos? (Wat — Qty —da,), cos? (wg, t— Qirtw — 89.) 
and 
2 208 (Wg jt — Q jt, — ôg) cos (git — Qty: —5q),) 


The quantity (u,— u,-)? is the superposition of a large number of 
waves and can be easily estimated by averaging each of its con- 
stituent terms. 

The average value of cosine squared is 1/2 while for the product of 
cosines the result depends on whether the pairs Q, j and Q’, j’ are 
equal or different. In the latter case, the corresponding waves are 
independent and the average of the product will be zero. If Q = Q’ 
and j=j, 








2 cos (Wg t — Q ty — ôo) cos (Q,t — Q;t,--— 5o,) 
= cos [20 t — Q; (ty + tv) — 259] cos Qj (ty—ty), (8.8) 
where the last term is of course time-independent. 
Thus 
[4 (ty — uv:)]? = 4] qageg,2 [1 —cosQ; (ty—ty)]- (8.9) 


The same result can be derived more rigorously, but we shall not. 
do so here. Substituting (8.9) into (8.6) we find 





1 —ig(t, -tyr 
IW? (a) 2 = ar | WEG)? De My? 
x {1 — ` py 2 | qeq,2a, |2 [1— cos Q; (ty— ty) }. (8.10) 


Q 3 


It is easy to sce that |°? (q) |? consists of two terms. One, 


1 ‘ ~igq(t., — tay: 
| We (a) li = spe |W? (a) 12 D>) oY, (8.11) 
yV’ 





corresponds to the case of a stationary crystal which we have. The 
effect of thermal motion is described by the second term which is 


a 1 aate 4, 
|W. (A) [ayn = yr | W? (a) [2 Dy emitt 
vv 


x {> > 2\q, €Q,29; |? [cos Q; (ty—ty)— 1}. (8.12) 
Q ij 
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Writing the cosine as a half sum of exponentials of imaginary argu- 
ments +iQ; (ty— ty) we obtain 


5 1 
Y cr dyn == Are ” (q) |? > >) | q, eQ.aQ, |? 
| We, (q) | ye | W jI; 
Q j 
x 5 fe TETP L eTit lyt _ da Faby yy 


vv’ 


= |W? (a) 12 Dy Dy Dd) {ôa oj tty | (En + Qj) co ag, |? 
Q jn 


+ ôq, £,-Q; | (En —Q;) eo ;2Q; |2— 25g, Bn | Ene; 2}. (8.13) 


When substituted into (6.1) this gives the Q wave contribution to 
the vibrational band structure energy, 


8U 5s (Q) = = > 23 t (En Q) eo; aq, |? B (gn +Q;) 


+ (gn — Q;) €Q ;2Q, | P (g,— Q;)} == = 5 > | £n&€Q j2Q; |? D (gn). 


n j 


(8.14) 


The total change in band energy clearly will be obtained by sum- 
ming Ups (Q) over all Q. In order to carry out the summation, it 
is of course necessary to know the way the amplitudes ag; depend 
on the wave vector and polarization vector. 

The effect of thermal motion on the electrostatic energy can be 
found in a similar fashion. We first find the total electrostatic energy 
and then exclude the terms corresponding to the stationary lattice. 
For the simple case we are considering (one atom in a unit cell), it 
is easy to verify that the contribution from the Q wave is 


| En tQ Neg e Q; |? a 


8Ue5(Q)= 22" 3 Taran m 
n j 


Ei l (En—Q)) €g 4 ,!° an 
J _______“J “j_ e 
| $n —Q; |? 


— 4nZ* a > 


It should be remembered that in both (8.14) and (8.15), the g, = 0 
contributions are dropped in the last terms. Clearly, by summing 
(8.14) and (8.15) we obtain the total Q-wave contribution to the confi- 
gurational part of the crystal energy. 


2 82 
| Enlg Le; | _ 


e in, (8.15) 
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We have thus demonstrated that, in principle, pseudopotential 
theory might well be used for determining the way thermal motion 
affects the crystal energy. In practice, the above expressions require- 
a knowledge of the wave amplitude, which makes their direct use- 
rather difficult. 

They can be used in a somewhat indirect way, however. Combin- 
ing pseudopotential theory with crystal lattice dynamics reveals- 
that the quantities 6Uy, (Q) + ôU.s (Q) may be quite useful in 
calculating phonon (or vibration) spectra, which can be measured 
quite reliably by modern techniques. It is therefore worthwhile to- 
outline here the basic principles of lattice dynamics. 

The potential energy of a crystal may always be represented as- 
a power series in atomic displacements. 


V=Vot >) 4-5], 


ou? 
vlj vl 





: +, 2 7 
+4 > why |] + eey (8.16) 
ww ll, ji Bayt Oy 
where v and l are the indices of, respectively, an atom and cell, j is 
the component of the displacement vector, and the subscript 0 
corresponds to the state of equilibrium. The ground-state potential 
energy, Vo, can be always set to zero and thereby made a reference 
point. Since the potential energy has a minimum at uj, = 0, it 
follows that [0V/duZ;], = 0. The only remaining term in (8.16) is 
quadratic in displacements and describes the harmonic vibrations 
of the lattice atoms. This is the term which corresponds to 6U above. 
The kinetic energy of the lattice is 


1 e. 
Exm= Diy My | use’, (8.17) 
vlj 


where M, is the mass of the vth atom. Since the sum of the kinetic 
and potential energies is constant,. it follows that 


Myu i= = 2 Drar | ubr, (8.18) 


This equation holds for each j-component of the displacement vector 
of each vth atom in each lth elementary cell. Equation (8.18) is 
therefore a system of a huge number of coupled differential equations 
and there is no hope of solving it directly. 

By classical mechanics, the quantities 


CH= [= |, (8.19) 


J j 
üy duy 
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-are the components of the force tensor. Since all the crystal cells 
aare identical, the G tensor does not depend on the values of Z and I’ 
but rather on their difference A. We thus have 


Gi ww = G4, (h) (8.20) 
which, when substituted into ue 
Muti © GR (h) wh ane (8.21) 
vA, 


“This expression is invariant under translation and hence satisfies 
the Bloch theorem, 


ubi (t) = cf uf, (t). (8.22) 
‘Substituting this into (8.21) gives 
Myc = —Y (NG, (h) D} ui cil, (8.23) 
vi h 


‘The expression within the curly brackets is the Fourier transform 
-of the force tensor G. Taking e?2! out of the sum and canceling it, 
we find 


Af 


Mui = — > Gi (Q) ut, (8.24) 
Vd 


where y (Q) is the Fourier transform for 6°V/dui, Oui, ». Since the 


Gi), are functions of Q, so too are the amplitudes of the thermal 
(elastic) waves and we may write 


Wi, = ujo. (8.25) 


It follows from the theory of oscillations that (8.24) may be trans- 
formed to 


> {Gy (Q) — oM vyr} ubo =0, (8.26) 
VJ 


which is a (homogencous) system of equations for the amplitude 


uj, g: This system can only have nontrivial solutions if the determi- 
nant of its coefficients is zero, which is the condition from which the 
frequency spectrum w (Q) can be obtained. But, as can be seen from 
the above argument, the sum ôUnps (Q) + ôU.s (Q) coincides with 
the quadratic term in (8.16) and therefore, the amplitude coefficients 
may be found by evaluating the Q-wave contribution to the change 
of the band and electrostatic energies. 

The number of equations in (8.26) is given by the number of pos- 
sible polarization directions (3) times the number of atoms in the unit 
‘cell. As this product is a factor NV smaller than the number of equa- 
tions in (8.18) (N beingthe number of cells in the crystal), the solution 
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of (8.26) becomes quite feasible. Strictly speaking, this system ought 
to be solved for all possible Q values (of which there are W). In prac- 
tice, it is enough to do this for a reasonable number of them, the 
solutions for the others being found by interpolation. As compared 
with (8.18), this approach requires far less numerical work. 

It follows from the above discussion that the dynamic matrix 
(8.26) may be found using pseudopotential theory. The calculation 
of its determinants then gives the phonon spectra of the crystal. 
These calculations are interesting in a number of respects. First, 
comparing the calculated and measured phonon spectra makes it 
possible to check the reliability of the pseudopotentials used and, 
if need be, to correct (fit) them. Second, the calculations provide 
information about the role electron-electron and electron-ion inter- 
actions play in constructing pseudopotentials. 

On the other hand, it sometimes proves possible, by comparing 
computed and measured spectra, to explain phonon characteristics 
which are responsible for atomic properties. 

Finally, pseudopotential calculations of the phonon contribution 
to the free energy (or, for finite pressures, to the thermodynamic 
potential) provide a necessary basis for solving fundamental solid- 
state problems suchas the calculation of phonon effects in electrical 
conductivity, the calculation of thermoelectric and galvanometric 
effects, and the compilation of phase diagrams. 

One application of pseudopotential theory to phonon studies is 
given in a paper by Animalu [455]. Using his own model poten- 
tial [343] for the transition metals, Animalu calculated the phonon 
spectra of some 3d, 4d, and 5d pure metals. As there is only one atom 
in the unit cell of a pure metal, the matrix in (8.26) will be (3 x 3). 
For the major axes of a cubic crystal we have 


2 11107853 — G7" (Q)] ea, = 9, (8.27) 
i 


where GIF is the dynamics matrix, which in this case may be writ- 
ten as 


G = Gi! + Gt + GU. (8.28) 
Ilere the first term describes the electrostatic energy, the second 


the Born-Mayer repulsion potential, and the third the band structure 
energy. For the vibration frequencies we have, accordingly, 


02 = og -+ OBM — Ws: (8.29) 


where 0g; = Gẹę M~! and so on. 

In actual calculations, the electrostatic contribution was written 
in a conventional form and the Born-Mayer term was dropped in 
view of the small size of the ions involved. The band-structure 
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term was written as 


y oou nZ? HEDO (QEZ g 
loh (Q = {> QERRE p (Q +gn) 


' ist 
— 3) 37 Oe)}. (8-30) 
nÆ) 
where 
@ (q) — | ——— e b 2 e(q)—t E 
(q)=|—g | Wa Pater: 830 
The results for FCC metals were found to agree reasonably well 
with experimental data on phonon spectra, at least for longitudinal 
waves (see Fig. 8.1). The agreement was less satisfactory for BCC 
metals. 
This, Animalu felt, might be the result of the local approximation. 
A more sophisticated phonon-dispersion calculation was attempted 
by Oli and Animalu [481] for the typical BCC transition metal V 


Frequency, 10" Hz 
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Fig. 8.1. Phonon spectra of (a) Cu and (b) Ag as derived from the Animalu po- 

tential [34 sf The longitudinal branch is marked by L, the transversal branches 

by 7, and T,, their respective experimental values are shown by the full cir- 
cles, empty circles and triangles. 
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They included nonlocal effects and the Born-Mayer repulsion and, 
when calculating the dielectric permittivity, they determined the sd 
hybridization using the resonance model of Animalu’s metal po- 
tential. 

The resulting w(Q) curves are plotted in Fig. 8.2. In the [100] 
direction, it can be seen that both local and nonlocal calculations 
agree well with experiment. In the [111] and [110] directions, the 
predicted longitudinal frequencies arerather too high and in the [110] 
direction the transverse frequencies are too low. That nonlocality 
allows better results for longitudinal waves may be a valuable hint, 
Oli and Animalu argue, towards an explanation of the nature of 
soft modes. The use of a nonlocal resonance potential also im- 
proved the results for Cu [302, 482] (see Fig. 8.3). 

We thus see that a comparison of calculated and measured spectra 
is indeed a good check on new potentials (see [73] for an extensive 
discussion on the subject). It should be noted, however, that phonon 
data alone may be insufficient for fitting, and information about 
other properties may be necessary [483]. 

So far we have mainly discussed phonon-spectrum calculations as 
a tool for testing pseudopotentials. They are also important for 
determining some of material properties. 

In [484] Animalu calculated the phonon spectra of the alkali-earth 
metals Ca, Sr and Ba and used them as a basis for the electron theory 
of phase transitions in these metals. At any finite temperature, the 
competing phases must have equal thermodynamic potentials (or 
free energies in the absence of pressure) for a transition to occur. 
Apart from the purely configurational contributions derived in the 
previous chapter for the stationary crystal, the phonon contributions 
to the internal energy and entropy may be of importance. The former 
is finite even at absolute zero, viz, 


1 . 
Opn = > Dy hogs (8.32) 
qj 


At finite temperatures, the phonon contribution to entropy is 


S ph = —ks >) In | 1—exp (-- ae )| 


q3 








+3 Me fox (HE) AT, (839 
qi 


and that of the free energy is 





hog: 
F n = kpT > In | 2sinh are |. (8.34) 
q? 
18* 
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Fig. 8.2. Vanadium phonon'spectra calculated by Oli and Animalu [481] from 
(a) a local pseudopotential and (b) a nonlocal pseudopotential. Bold lines—calcu 
lation. L, 7,, and T, denote the longitudinal and two transversal branches, the 
respective experimental values are shown by full circles, empty circles, and tri- 
angles. 
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Fig. 8.3. Cuprum phonon spectra calculated by Dagens [302] in local approxima- 
tion (dash lines) and with nonlocal effects (bold lines). Other notations are as 
in Fig. 8.2. 
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Using these formulae Animalu found the phonon frequencies (taking 
into account the band and electrostatic contributions) and obtained 
the phonon part of the free energy (Fpn)- He then calculated the inter- 
nal energy U = U, + Ues + Ups and the thermodynamic potential 


G (p, 1) =U + Fyn + PR. (8.35) 


The resulting phase diagram for Sr is shown in Fig. 8.4 together 
with the experimental phase diagram. The agreement is rather good 
considering the time of the study (note that virtually no fitting was 
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Fig. 8.4. Phase diagram for Sr: (a) experiment, (b) theory. 


carried out for the potential). Physically, the reason for the FCC-BCC 
transition is that the phonon spectrum of the BCC lattice contains 
a low-frequency transverse branch which markedly increases the 
phonon part of entropy at elevated temperatures. For Ba, too, the 
results are satisfactory, at least qualitatively, whereas calculations 
for Ca show rather poor agreement with experiment, presumably 
because of d-band effects. 

The role of thermal vibrations in the theory is not limited to the 
results we have cited. 

Regrouping the terms between ye. (q) js and IW? (q) jayn it 
can be shown that 


1 1 Li -ta 
| Wela) |? = ae | W’ (a) 2 D ei {1-2 X) | geg ao, |} 
vv’ Q,3 
1 -i =A, 
+ spr | W° (a) 1? X, 21 qeg aa, 12671? cos [Q; (ty — ty:)]. (8-36) 
vv’ Q; 
This expression has an analog in the theory of diffraction on a vi- 


brating crystal lattice [3, 485]. It is easy to see, that the term in the 
curly brackets in (8.36) corresponds to the Debye-Waller factor 
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eM where 


2M =2 2: | qeq,2q, |?- (8.37) 
J 


In X-ray analysis, this factor is known to damp down the X-ray 
maxima, thus reflecting the physical fact that the peripheral electron 
densily of the atoms is “smeared out” by their thermal motion. In 
pseudopotential theory we have, by the same token, 


| Wr (a) 2 =| Wo (q) [2e7%™, (8.38) 


where IV? (q) and W# (q) are the matrix elements at, respectively, 
zero and finite temperatures. The reader is referred to [3, 485] for 
a detailed discussion of the factor M. A few points worth mentioning 
here are as follows. 

At high temperatures (T >> O) we have, in the Debye approximation, 


m=3 Mea 


3 MatkpO? ’ (8.39) 


where ÆA a is the mass of an atom and © the Debye temperature which 
is usually between 100 and 600 K. In typical metals and semicon- 
ductors, the room temperature value of 2M is about (4-8) x 10-7? 
for the shortest g,. Note that 2M is very sensitive to the magnitude 
of the reciprocal lattice vector (as gł). Raising the temperature 
increases M both explicitly, through the factor 7, and implicitly, 
through the Debye temperature ©, which decreases with tempera- 
ture. The low temperature expression for M is rather involved, but 
it is greatly simplified in the zero-temperature limit: 


3 Agh 


MX S Marks’ 





(8.40) 


As this is only one fourth the Debye temperature M value, it follows 
that, in principle, a pseudopotential formfactor should always be 
corrected by a Debye factor of the form (8.38). Even though the cor- 
rection may be small, its possible effect must always be estimated. 

This result is very significant, for pseudopotential formfactors 
are essential in determining important properties of crystals such as 
the band structure and the temperature dependence of band gaps. 
It is worth mentioning, in this connection, a paper by Altshuller 
et al. [486] on the band structure of ionic-covalent compounds. By 
computing energy levels at a number of high-symmetry points, 
Altshuller and coworkers were able to calculate: (i) the band-gap 
temperature dependence of Si, Ge, GaAs, InSb, and GaSb, (ii) the 
p-T diagram for Si, InSb, and CdTe; and (iii) the Ge-Si phase 
diagram. The effect of temperature was taken into account by multi- 
plying the pseudopotential formfactors by a Debye-Waller factor, 


Ch. 8. Pseudopotential theory and imperfections in crystals 279 


the effect of pressure was allowed for by varying the lattice para- 
meters. 

Altshuller and coworkers concluded that the stability boundaries 
of semiconducting compounds could be found by determining where 
their indirect band-gaps have disappeared. For the compounds studied, 
the estimates obtained by this criterion agreed reasonably well 
with experiment. 

To sum up, phonon effects are readily incorporated into the 
pseudopotential theory of crystals and greatly extend its potentiali- 
ties in determining major material properties. 


8.3. Static imperfections 


These are (i) the substitution of one atomic species for another, 
(ii) the disappearance of an atom at one place and its appearance at 
another, and (iii) the displacement of atoms around a distortion site. 
A common assumption is that atomic cores remain undisturbed in 
a distorted crystal. This means that static imperfections could be 
regarded as “frozen” atomic vibrations which may be treated adiaba- 
tically (taking into account, of course, changes in interatomic separa- 
tion, average charge density, etc.). But then the effect of imperfections 
on the crystal energy may be reduced to a change in the structure 
factor, and the energetics of an imperfect crystal would reduce more 
or less to that for an alloy or vibrating crystal. An illustration of this 
is a single-component crystal containing substitutional point defects. 

Starting, once again, with the band structure energy, we first cal- 
culate the matrix element 7/2,(q) taking into account the displace- 
ment of ions about the defects and the change in the formfactors 
at the defect sites. Consider a crystal with Ne randomly distributed 
defects (c being the defect concentration). We have 


W esa (t) =p Dy ((1— ey) Wm (r— ty — uy) + ey (t, — u), 


(8.41) 


where the c, are the operators defined in Chapter 6, and W% (r — 
t, —u,) and W} (r — t, — u,) are, respectively, the host-atom 
and defect pseudopotentials at the point t, + u,. Taking the Fourier 
transform we obtain 

1 _3 =} 
Wer ala) =- X C- ey) Win (g) 0 E 4c Wa (q) e7), 


v 


(8.42) 


Let us now consider the energy change caused by Ne vacancies which 
do not displace their neighboring atoms (note that we are dealing 
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with a single-component crystal). We introduce the matrix ele- 
ments for the “average” and “difference” ions in the crystal 


W (q)=(1—c) Wh (q)+-0, AW?(q)=0—W2(q). (8.43) 
Using (6.5)-(6.13) we obtain in this approximation 
uae) 
| Wes, ala) 2=(1—0)? | Wha (a) |28q, ¢ + E | WE, (a) 2, 
(8.44) 


and for the band structure energy, 


Q ' n 
Unsa =e D Aao EEn | Wa (ga) l? 


n 





Qe (1 —c) ime (q) b = 
pE S g ERAN | waa) |, (8.45) 
q 


where in the first and second sums, respectively, the terms with 
g, = 0 and q = g, must be dropped. 

Because of the low defect concentration (e < 107?) it is not too 
crude to assume that the lattice period remains unchanged in the 
presence of defects, and we may write 


1—e (gn 
Uys, a — Uns, pert = -> > "(= 204-02) g En) | Wha (En) l? 


Qc (1— ” 1 
$e S g ERA | Wala) (8.46) 
q 


Since the number of terms in the second sum is N times that in the 
first, the two sums must be of the same order of magnitude. Further- 
more, since the sums are opposite in sign, the sign of Ups, a — 
Ubs, pert May only be determined by an actual calculation for 
a specific model. Needless to say, the calculation should include 
the change in the average atomic volume and related parameters. 
There is another interesting point about equation (8.46). In its 
first terms, the wave vectors of the initial and final electron states 
are allowed to differ by a reciprocal lattice vector, in the second 
term they may differ by any reciprocal vector. Hence the first term, 
which differs from its perfect-crystal counterpart by a constant 
factor (—2c + c?) will not contribute to electrical conductivity, 
thermo-emf, or other transport properties, and the residual resistance 
of an imperfect crystal will be determined by the second term 
alone. It has been shown [17] that the electrical resistivity of some 
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materials is qualitatively described by the formula 


2 
p= 3am 2 | | s(a) [2 | W (a) [22° dz, (8.47) 
0 





where x = q/2kp. 

Our next step is to include atomic displacements. The point of 
departure will be equation (8.42), in which the displacement u, may 
be expressed as a sum of contributions from all the defects, 


u=) Cy'Åy-ys (8.48) 
7 


where the displacement function A,_,, depends on the distance 
between the displaced atom (at the vth site) and the disturbing 
defect (at the v’th site). Since (8.48) is the convolution of the functions 
c (r) and A (r), it is convenient here to take their Fourier transforms. 
The Fourier transform of a convolution of two functions is the prod- 
uct of their Fourier transforms, hence 


Uo =CoAo. (8.49) 


Here Cg is found from (6.9) and the coefficients Ag are obtained [418] 
from the equilibrium condition with respect to displacements. 
Treating a crystal as an isotropic elastic continuum, 
— ite 1 WV Q | 
Ag= 3 (1—0) V de Q? ’ (8.50) 
where o is Poisson’s modulus and (1/V) (dV/dc) describes the depend- 
ence of the specific volume of the crystal on defect concentration. 
The expressions for Ag become more complicated in anisotropic elas- 
tic crystals and still more so in microscopic theory. For the [100] 
direction in an FCC lattice, 
1 | al av Qa 
Ag=5 d (cu -1 2C42) Ca (cot | Q, (8.51) 
where c; and ¢,, are the elasticity moduli, and d is the lattice 
Parameter. 
A knowledge of the Ag makes it possible to determine the displace- 
ment of an ion as a function of the distance to the defect (impurity 
atom), viz. 


uy (t)= +> >) Agsin Qr. (8.52) 
Q 


The mean square of static displacement is 


u=) | co | Ad. (8.53) 
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For a random distribution of defects, 


u= -r c (1— c) M°, (8.54) 
where M° = b (1/V)? (dV/dc)*. If only nearest-neighbor interactions 
are included and noncentral forces are considered (the Born-Begbie 
approximation, [487]), the value of b is about 0.25-0.45 in typical 
metals Ag, Al, Ni and the like. 

We consider next the contribution from atomic displacements to 
the square of the modulus of the matrix element, | 7/72, a (q) I. 
By analogy with the effects of thermal vibration, we expect the 
result to contain two parts. The first one may be written as 


| Wera (q) |? = | W? (q) |2 est (8.55) 


q. gn’ 


where Msı = (1/2) qu,)? and the g; are the reciprocal lattice vectors 
for a crystal distorted by defects. 

The band-structure energy associated with this part of the ma- 
trix element is 


1 i 1— n u , — 27] 
Uis, a= 2 e D (e w) Fee | W° (gh) |2 e77™st 6, ge. (8.56) 


It can be seen from (8.55) and (8.56) that if crystal defects displace 
atoms from their perfect positions, the interaction of the electron 
gas with the defects alters the electron-transition matrix elements. 
If Q, is little affected by defects, the absolute magnitude of UR 
will be less than that in a perfect crystal. Since Ups is negative the 
atomic displacements will have, through Uys, a somewhat desta- 
bilizing effect on the lattice. It is implied, of course, that, in an 
actual calculation, the electrostatic contribution is also included. 
This will be achieved if, in (8.56), we replace 


Q; (gn)? 


7 1 Br 27 ~re = 
Ba (fn)? 72+ | Ww? (Zn) | 2 by KACA Z*2e an, (8.57) 
The second part of the square of the modulus of the matrix element 
is given by 


| WAR (q) =F | W (a) [2 {| Ag (En +Q) [ôa 6,40 
| Ag (gn —Q) 128g, ¢,-0} (8-58) 


By substituting this into the formulae for the band structure energy, 
we find the change in this energy, the sign of which will coincide 
with that of Uys. Therefore, both the magnitude and sign of the 
total energy change can only be determined by rigorous calculation. 
(Note that (8.58) should be complemented by the electrostatic ener- 
gy; this is achieved by the substitution defined in (8.57).) 
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The value of the internal energy may be used for calculating the 
free energy. By minimizing this energy it is possible, in principle, 
to find the equilibrium concentration of defects. 

In the above discussion, the Ag coefficients were assumed to have 
been calculated either for the case of isotropic elastic continuum 
or in the Born-Begbie approximation. Pseudopotential theory 
offers new possibilities by allowing the calculation of interatomic 
potentials at any distance from a defect. 

Using the formalism developed in [418, 488] the a-component of 
the displacement of a host ion may be written as 


a (ty) = — i ai \ a5 fab 


x {J (aet gag) F (at+en)pe My Fw. (8.59) 





Here t, is the position of the ion in question, T, is the position of 
a defect, the integration is over the Brillouin zone and 


3 
č (q, 2 4) eg (q, A) 
gas (9) = D (q)ans= > ee (8.60) 
n=l TÀ 
is the Fourier transform of the lattice Green’s function [489], where D 


is the dynamics matrix, M the mass of a host atom and e (q, A) the 
polarization vector with a polarization index å. The quantity 


F (ty) we k F (k) ev- Pw) (8.61) 


is the Fourier component of the force exerted by the impurity atom 
upon the host atom located at the site t,. 

The above expressions may be derived using the harmonic approx- 
imation for the total energy of the matrix-impurity system, 


U =U (Q) + + va? F (ty) uy + oy ` Dyav'a'Uavta’y’, (8.62) 
vava’ 


where U (Q,) is the energy of the crystal with periodically regular 
atoms and 


Drava =r J; D(qye Ms“? (8.63) 
q 


is the site representation for the dynamics matrix of the pure crystal. 
We have seen, on the other hand, that the total energy of an 
alloy may be expanded as 


U Ue, + UQ) + UM (Q) HUDHUD... (8.64) 
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Here again the terms U.s, U® and U() are structure-dependent. 
Their form was discussed above. For example, 


Q ' 1— ar 
Ue) = SS @ 1 f(g) PASE , (8.65) 
q 
where 
1 iqr, , | iT, 
F) = -y Wm (a) D e- AN (0) (8.66) 


and AW (q) = Wa (q) — Wy (q). By setting r, = t, + u,, expand- 
ing (8.66) in powers of u, and substituting in (8.62), we find .¥(t,) 
and Dyav'a’. In second-order perturbation theory [490], we have 


FO (k) = — W m (k) AW (k) 2% -$ [1 — 67 (k)]. (8.67) 


In principle, third-order corrections can also be obtained. Using 
the above relations Solt and Zhernov [491] calculated atomic displace- 
ments for dissolved atoms and, in an asymptotic approximation, 
for diluted alkali alloys. The displacements were found to be very 
anisotropic and the asymptotic approximation was found to be 
valid from the eighth or tenth coordination sphere. 

Thus far we have been discussing the pseudopotential approach to 
static substitutional defects. The distorting effect of interstitial 
atoms may be considered in the same spirit [492]. 

In this case, the band structure energy is calculated using (6.1) 
and (6.2) with the crystal pseudopotential in the form 


D ora (r) = > [W m (x —t,— uy) + cyWa (r— ty— @)I, (8.68) 


where the vector æ characterizes the position of the sites of the 
z-lattice of the interstitials relative to the matrix sites. In the 
Fourier representation we then find, to the first order in atomic dis- 
placement, 


W (9) = Wm (4) [ S (a) — i È Ua; (aes (Q)S (4+ Q] 
+ Wa (a) 2 CoS (q+ Q), (8.69) 


where the Q summation is over the Brillouin zone and the j summa- 
tion over the polarization directions, as determined by the vectors 
e; (Q). 

Substituting (8.69) into (6.1) yields the band structure energy, in 
which the electron scattering term (arising from impurity and de- 
formation potentials) is given by 


Unsa= 2 Pys. a (4) S (q + %){—i (qUa) cze-ia%+c.c.}, (8.70) 
q, 1% 
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where 
Pys, a = WaW my (9) € (9). 


Transforming the complex-conjugate term, we obtain 


Unsa = —2i È Orsa (lEn +Q I) (Bn +Q)Uo) Cae Mr, 
(8.71) 


where we have taken advantage of the fact that the structure factor 
is only nonzero if its argument is a reciprocal lattice vector. The 
electrostatice contribution can be found from (8.74) by replacing 
Dys, a by 202 m2 3/Q7° where Zm and Zą are the valencies of, respec- 
tively, the host and impurity atoms. The renormalization of the 
screened impurity formfactor, 

Wa(a) > Wa (0) = Wa (q) HR Sa (8.72) 
secures the low-q convergence of the Fourier transform of the total 
effective interatomic potential. The deformation effects of impurity 
atoms will then be described by the term 


Ua= —2i 2) Da gn + QI) (En +Q Uo) ) Cae nt, (8.73) 


where 
Da = Des, a + Phs, a (Wa > Wa). 


Comparing this with the corresponding phenomenological result [401] 
yields a relation between the quasi-elastic force Fo and the micro- 
scopic parameters of the system, 


Fa=—td (En +Q) Pall srt Qe (8.74) 


The effective deformation potential Vg is then given (omitting 
a constant factor) by 


3 

— | e; (Q) FQ |? 

Vo= — a no (8.75) 
j=1 
where m is the mass of the atom and a; (Q) the phonon frequencies 
of the host matrix. 

Belen’kii [492] used (8.74) and (8.75) to calculate the quasi-elastic 
force and effective interaction potential for the case of carbon and 
phosphorus interstitials in a-iron. The formfactors for Fe, G and P 
were taken from [343] and the exchange-correlation effects were 
treated in the Hubbard-Sham approximation. 

The calculated quasi-elastic force was more anisotropic than 
that observed (the predicted z-component was much larger 
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than the z- and y-component). This discrepancy might be due to the 
strong electrostatic repulsion which exists between impurity atoms 
and their nearest host neighbors on the z-axis (notethata = (0, 0, 
a/2)). The other possible reasons are the formation of impurity- 
matrix covalent bonds or underestimation of the screening of impu- 
rity atoms. To include these effects, however, requires higher-order 
perturbation theory. 

It was found that phosphorus Vg curves were always below those 
for carbon. The Vo behavior in the vicinity of the (0 1/2 1/2) super- 
structural site reveals that the phosphorus solution has a somewhat 
stronger tendency to form a metastable ordered phase. A similarity 
between the pseudopotentials of phosphorus and nitrogen enabled 
Belen’kii to consider his result as consistent with experiment. 

Belen’kii concluded that even though his pseudopotential calcu- 
lations were too simplistic to be quantitatively reliable. they were 
quite useful for qualitative estimates. 

To sum up, pseudopotential theory is now being extensively ap- 
plied in defect-related problems. These include the role of defects in 
the formation of real crystal structure, the formation energy of 
defects, and the stability of defects and their influence on the physi- 
cal properties of solids. Note that the types of defect treatable by 
pseudopotential theory are by no means limited to those considered 
in this book. Other results obtained by the pseudopotential method 
may be found in [17, 73, 493-499]. 

At present, the pseudopotential theory of real crystals may be con- 
sidered one of the more important fields of solid-state theory. 


Chapter 9 


Principles of pseudopotential calculations 
of the properties of metals 


9.4. General 


Basically, there are three types of physical properties which can 
be calculated by the pseudopotential method. The first type demands 
the value of the total crystal energy or some of its components, the 
stability of a crystal lattice being the best known example. Calcu- 
lations of this type have been the subject of many papers and are 
reviewed in [500-503] which makes it unnecessary to discuss them 
here. 

The second type involves the properties associated with the way 
a solid responds to external influences (including heat). These prop- 
erties are therefore determined by coordinate derivatives of the 
total energy rather than by its magnitude. The phonon spectra 
discussed in the previous chapter are one example of this kindj of 
property. In this chapter, we will discuss elasticity, Griineisen’s 
constants, the equations of state, etc. ] 

The third type of properties involves electron transport effects 
and are determined by the matrix elements for the electron transi- 
tions between energy states. We will consider disordered systems 
such as liquid and amorphous metals and alloys, and disordered 
crystalline alloys to illustrate the main idea underlying the calcu- 
lation of these properties. We have emphasized the principles and 
the citations and information are neither exhaustive nor complete. 


9.2. Calculation of the atomic properties 
of crystalline metals and alloys 


The physical properties of both perfect and imperfect, crystals 
may be associated either with the total energy of the crystal or 
with the matrix elements for the electron transitions. In the former 
case we may deal with cither the energy itself or with its coordinate 
or deformation derivatives. When dealing with matrix elements the 
square of the modulus of the matrix element is often involved. In 
what may be considered a canonical way of using pseudopotentials, 
the first step in calculating a physical property is to write an expres- 
sion for it in terms of the total energy, the total energy derivatives, 

and the matrix elements. A calculation is then carried out by sub 
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stituting those quantities found by the pseudopotential method for 
the specific problem. It is this method that will mainly be followed 
in this chapter. 

Pseudopotential calculations of the elastic properties of crystals 
have been performed in quite a number of papers (see [500-512, 276, 
482]). Elastic properties are determined by the derivatives of the 
total crystal energy with respect to the parameters of deformation. 
The parameters are defined by 


Ox, Ox’, 
m=z (2 n Ti —ô;;) 1 i, 1,2, 3, (9.1) 








Ox; Ox; 


where i and j indices are usually replaced by J indices according to 
Focht’s rule (11— 1, 22— 2, 33— 3, 23— 4, 31 — 5, and 12 — 6). 
The second-order elasticity coefficients in this notation become 
1 OU 

TIO ( ans nj |a" (9.2) 
In order to describe a transition from a nondeformed state to a de- 
formed one, it is customary to employ the deformation parameters £;, 
y;, and v defined by the relations 


x’ =x (1+8) x =£ Yy, z! = vtr; 
y' =y (1+ 8)“, y =y, y' =v*'/3y; 
z' =Z, 2’ =Z, z = vuz; (9.3) 


which relate the coordinates of particles in a nondeformed crystal to 
those in a deformed one. The parameters y;, and &;; may be 
defined in a similar fashion. In terms of these parameters the second- 
order elasticity coefficients take the form 


aU total 1 

a = 5 (Cu + 2c) = B, 

OU total 

ay = Cys (9.4) 
1 OU total 1 
ar a = = 5 (Cy — t) =c". 


The first of these coefficients is known as the bulk modulus and 
the others are shear moduli. 

The total energy of a crystal which has a volume V and is made up 
of N ions of valency Z (so that the free-eleċtron-gas density is ny = 
zN/V = 3/(4mnr)® and the volume of the Wigner-Seitz cell is 
VIN = 4nR3/3 = Q) is 


Utotal = Uo + Uy + Uns + as: (9.0) 


The first term on the right is the energy of a free electron gas, which 
is the sum of the kinetic, exchange, and correlation energies. Its va- 


N 
Loe) 
ites) 
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value per atom is 


U, =Z (= — 2! _ 0.115 40.031 Inv). (9.6) 


rê rs 








The second, third, and fourth terms are, respectively, the average 
energy of electron-ion interaction 
Z 


. 8; 4 
U,= lim or [ = Wer (a) | = Bro, (9.7) 





the band structure energy 
Uns =Y S0 WA Px D=L Oo, (98) 
q 


and the electrostatic energy 
Ues =- ~aZ?/2R,. (9.9) 


œin (9.9) is Madelung’s constant, and e*(q) in (9.8) is the dielectric 
permittivity containing corrections for exchange and correlation. 
The function s* (q), the dielectric susceptibility y (q), and the func- 
tion f (q), which describes the exchange and correlation corrections, 
are related by (see also Secs. 3.2, 6.2, and 6.3) 


e* (9) =1— Ga [1—4 (91x (0). (9.10) 


Since U, and U, only depend on volume and do not depend di- 
rectly on structure, they are often combined together to give what is 
called the volume-dependent contribution to the total energy. A third- 
order perturbation theory correction U, may also be included when 
calculating physical properties [506, 508, 509]. 

The average energy of electron-ion interaction U, is defined differ- 
ently by different workers. Apart from directly calculating the 
long-wavelength limit of B, this quantity may be found [507, 5114] 
by setting the derivative 


OC total — 
Ors 


equal to zero in the absence of external pressure. To find the elasticity 
coefficients, the derivatives of the total energy with respect to 
deformation parameters must be found. The expressions for some of 
these derivatives are reproduced below from the paper by Suzuki 
et al. [504]. The first and second derivatives of the characteristic 
band structure function D),(y, rs) with respect to v are of the form 


bP (N, rs) _ 1. 2D (n, rs) 





—P (9.11) 


ðv 3 5 ôr ? 
s (9.12) 
a0 (N, rs) 1 2 dO (N, rs) 2r dD (N, rs) 
ðv? ~g s ôr? 9 s ars , 


where q = q/2kp. 
19—01063 
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The derivatives with respect to the shear parameters y; and €; 
have the form 


aD (yn, rs) — ?D (N, Fz) ( an j 1, ôP (n, rs) ôn 

Ove yı an avi 
In order to obtain the derivatives of ņ with respect to y; and e; it 
should be taken into account that deformation changes the transla- 
tional vectors of both the direct and reciprocal lattices. For the BCC 
lattice we have 





(9.13) 





on — -— (5V7 Mmo 

ya 6 ; (mê --m3-L m3)1/2 ? 

ay a \ 1/3 iom 
an _ (41 aa S a Or) 
ay NG e mT nem E 
an m \1/3 7 mim? i 3m? -- m? ` 


where the integers m,, m, and m, are the components of reciprocal 
lattice sites. 

Using (9.14) the second derivatives of the characteristic func- 
tion with respect to yı and e, are found to be 


Pens rs) Pom red (2)! mim, (ny rs) (z) 
6 6 


av} an? mimir mz ` ôn 
mm m? 
<{ Game t+ acme j’ 
(9.15) 
ôP (yn, rs) O° (N, rs) (+) 2/3 (mj—m})* 
ðe? — an? 6 m? + m2-- m2 
a DN. rs) (= 1/3 f (m? — m3)? | 3m? + m3 
rT T) O (mmg m (me met mg)? Ie 


The derivatives of other components of the total energy were also 
obtained in [504] but the corresponding expressions are too cumber- 
some to reproduce here. 

From these results, the elasticity coefficients for a number of 
the alkali and other metals have been calculated. In [505, 506] the 
coefficients c11, Cy, and c,, were calculated for Al and Na, and from 
them the parameters B, c’ and c,, were found. Benckert used an 
Ashcroft model potential (with a damping factor) whose param- 
eters were fitted to experimental Fermi-surface data. 

Table 9.1 shows the (computed) contributions to the total energy 
(U,is the third-order correction) and to the elasticity coefficients. 
The sums of the contributions are also shown and can be compared 
with the corresponding experimental results. 
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Although the computed and experimental values can be seen to 
agree fairly well, it turns out [506] that the agreement depends sig- 
nificantly on the choice of potential, the third-order contributions 
being the most sensitive ones. 

Shimada [507] employed a three-parameter model pseudopotential 
having a continuous logarithmic derivative. The parameters of the 
pseudopotential were fitted to equilibrium conditions. It can be 
seen from Table 9.2 that the values of B andc’ computed in [507] 
agree well with the experimental data. There are at least three 
general conclusions to be drawn from the above results. 


Table 9.2, Elasticity constants c’ and B for Li. Na, K. Rb. Cs, Al, Pb 
(theory [507] and experiment) 


ce’ (1011 dyn/cm?) B (41011 dyn/cm?) 
Element 
theoretical experimental theoretical experimental 
Li 0.417 0.117 1.335 1.365 
Na 0.0761 0.075 0.737 0.775 
K 0.0376 0.0377 0.363 0.366 
Rb 0.0279 0.0287 0.281 0.291 
Cs 0.0218 0.0205 0.216 0.219 
Al 1.306 2.619 7.944 7.938 
Pb 0.647 0.506 4.887 4.879 


The experimental values are obtained for each element by averaging the data from 
Table 3 of [507]. 


First, it may be argued that second-order elasticity constants can 
be calculated rather accurately for pure metals by fitting pseudopo- 
tential parameters both from atomic properties (equilibrium condi- 
tions) and from electronic characteristics (the shape of the Fermi 
surface, etc.). The pseudopotentials thus obtained may then be used 
to calculate the other parameters of pure metals. Second, these 
results may provide a basis for calculating the elastic properties of 
alloys. And finally, comparing computed and experimental elastic 
parameters may help bring out the factors which determine the 
elastic properties of crystals. 

These conclusions may be said to have been confirmed. In their 
work on pure metals, Khanna and Sharma [512] fitted the model 
pseudopotential of [513] to Fermi surface data in order to calculate 
the cohesion energy, compressibility, electrical resistivity due to 
single vacancies, third-order elasticity constants, and the supercon- 
ductivity transition temperature. Some of their results are given 
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in Table 9.3, which actually consists of two parts. In columns í 
through 7 the computed values are compared with the experimental 
results and it can be seen that pseudopotentials fitted for one prop- 
erty may indeed be useful for determining another. For the theoret- 
ical values given in the last two columns, no experimental data 
are as yet available, but judging from the good results in the pre- 
vious columns, the predictions can be expected to be realistic. 


Table 9.3. Pseudopotential results and experimental values for some physical 
properties [512] - 





Cohesion energy Compressibility TK 
(Ry/el) (K/K0) c 
: , , Gt o (rrr (4 O44 
Element 3 Le = LE 7 LS (H@/at.%) dyn/cem2} 
2 22 5 35 z 2z 
Li —0.550 —0.551 1.65 2.0 0.014 0.08 0.49 — 
Na —0.470 —0.460 1.30 1.5 0.0071 — 1.01 —6.82 
K —0.396 —1.390 1.08 1.0 0.0071 — 1.58 —4.99 
Rb —0.371 —0.366 0.90 0.82 — — — — 
Al — — — — 2.0 1.1496 0.57 — 
Pb — — — — 6.5 7.23 0.98 — 





Vaks et al. [483, 510] used the method of adjusting pseudopotential 
parameters to calculate various properties of the alkali metals such 
as elasticity moduli, phonon spectra, equations of state, dielectric 
permittivity, and the effect of third-order perturbation theory correc- 
tions. They thoroughly compared their results with experimental 
data and the reliability of the results was carefully checked against 
the various factors in the calculation. It is pointed out that the 
condition 


AU total __ 
~a, = 9 (9.11a) 


is crucial for a correct description of the static properties of metals 
in that it fixes the constant B and hence determines the quantity U, 
(see Eq. (9.7)), which greatly contributes to all the atomic proper- 
ties as Table 9.1 shows. Another important point is that using their 
fitting procedure the experimental dielectric permittivity was fitted 
best if the exchange-correlation corrections were taken in the form 
proposed in [514, 429]. It was also found that the third-order contri- 
bution may be significantly reduced by redefining the pseudopoten- 
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tial parameters, the important implication being that a large third- 
order contribution may simply be due to a “bad” pseudopotential. 
A comparison of computed results with experiment showed that 
the local pseudopotential of [483, 510] is not only useful for Na but 
also for K and Rb, even though the d-band in these two metals is 
rather close to the Fermi level. For Li and Cs, however, the computed 
values of the cohesion energy and the derivatives of the elasticity 
moduli disagreed significantly with experimental results, although 
the phonon spectra, equations of state, and the moduli themselves 
were reproduced correctly. Pseudopotential calculations may thus 
be very helpful in predicting physical properties although compari- 
son with experiment is always needed to determine the reliability of 
results. It should also be kept in mind that the accuracy of the 
results may vary from element to element if for no other reason than 
because the core states contribute differently to different elements of 
the same group. The optimization schemes may also differ. In [510] 
the volume dependences of some properties of alkali metals and the 
equations of state for a wide range of pressures were calculated. It 
is shown that in the well-studied pressure range of p < 50 kbar, the 
results for Li, Na, and K are in good agreement with experiment 
and the uniform compression data on Na and K were even reproduced 
for pressures as high as 400 kbar. It should be expected therefore that 
pseudopotential calculations may be helpful at high pressures, design 
pressure gauges being a possible application. 

There are other approaches to pseudopotential calculations of 
elastic characteristics. Brovman and Kagan [508, 509, 276] used the 
relations which exist between the elasticity properties (the moduli 
and sound velocities) and the dynamic matrix responsible for the 
phonon spectrum of the crystal. In cubic crystals, for example, the 
propagation velocities for the longitudinal and transversal waves 
in the [400] symmetry direction are, respectively, 


Vy = (€y,/p)/? and Uqi00) = (€x,/0)*/?, (9.16) 


where p is the density of the crystal. 
The shear modulus c’ = (1/2) (c,, — c,2) is directly related to the 
propagation velocity of the transversal wave in the [100] direction: 


V~_110]= [hci — C12)/20] 1/7 (9.17) 


(atoms are displaced along [110]). 

Since the velocities are, in turn, the frequency-to-wave vector 
ratios of the corresponding vibrations, clearly a pseudopotential cal- 
culation of phonon frequencies makes it possible to find elasticity 
moduli. We refer the reader to [276] for more detail and only repro- 
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duce here the results, which are 
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It is worthwhile discussing the relationship between the dynamic 
moduli as defined from the sound velocity and the static moduli. It 
was shown in [509] that the inclusion of multi-ion interaction 
should make these moduli equal. That this is indeed the case was con- 
firmed by a comparison of results obtained by both methods. 

Thus far we have been concerned with the “ground-state” proper- 
ties of a crystal. The dependence of these on external parameters 
such as temperature and pressure may also be determined by the 
pseudopotential method. Portnoi et al. [501], for example, calculated 
Griineisen’s constant y, which characterizes the changes in phonon 
frequencies v; caused by variations in volume V 


dln Vi 


Apart from the constant itself (in fact, the constant’s average over 
the frequencies is usually computed), its frequency dependence has 
been calculated [515-517]. In order to determine the (average) value 
of Griineisen’s parameter, it is convenient to write 


4 dlnv? 
Y= F hny ’ (9.20) 


where v? is the second moment of the frequency spectrum. Since v? 
is related [518] to the coordinate derivative of the interatomic poten- 
tial V, we find for the pairwise central interaction 


Daf- i ymt mR] 


1 i 
de [4 nmw] 


(9.21) 
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where C; is the coordination number for the ith coordination sphere 
of radius R;, and Vp, Vp, and Vp’ are the derivatives of the inter- 
atomic potential. 

In Table 9.4 the y as computed in [519] are given with their ex- 
perimental values. 


Table 9.4. The values of y for the alkali metals 


G | Na | K | Rb | Cs 


theoretical 1.015 1.024 1.102 1.086 
experimental 0.94 1.159 1.28 1.28 


In view of the strong frequency dependence of Grüneisen para- 
meters [517] the agreement between theory and experiment is rea- 
sonably good. 

A knowledge of y enables Senoo et al. [520] to relate pressure to 
volume by using the Mee-Grüneisen equation in the Debye approxi- 
mation [524] 





AN; 
; E hes hp 
p=— (“apt ) + 078" DS (ar) Ty (9.22) 
' 1—e BT 


and solving (9.11) for the absolute zero temperature. A Heine-Abaren- 
kov model potential was chosen from the condition that the equilib- 
rium values of lattice constant as found for (absolute) zero tempera- 
ture and zero pressure should correspond to the experimental values. 
The volume-pressure dependence in the range from zero to 100 kbar 
was found rather accurately for Al, Si, Ge, and four alkali metals 
(Na, K, Rb and Cs). Calcium alone gave poor agreement. 

It was pointed out in Chapter 8 that the pseudopotential method 
is applicable to real crystals. Soma [522] employed the method to 
estimate the energy of vacancy formation and to demonstrate the 
(rather substantial) dependence of this energy on relaxation [522, 523). 
Du Charme and Weaire [497] used the principles discussed by Harri- 
son [73] to calculate the self-diffusion activation energy. They found 
that the activation energy isthesum of the vacancy-formation ener- 
gy [73] plus the energy of vacancy migration. This latter may be 
found when calculating Ues and Ups by taking into account that 
diffusing atoms migrate via saddle points located between lattice 
sites. The computed self-diffusion activation energy for Al was close 
to the measured value but it was very overestimated for Li, Na, 
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and K, implying that the migration of single vacancies is the mecha- 
nism for self-diffusion in Al alone and that in the alkali metals they 
studied it is not. The same scheme may be used for calculating the 
energy of formation of vacancy pores produced, for example, in ir- 
radiated metals [498]. 


9.3. Transport properties of noncrystalline metals 
and alloys 


An important application of pseudopotentials is the calculating 
of the transport properties of disordered materials such as liquid 
or amorphous metals and alloys. The topic was first considered in 
[524-527] in which the expressions for electrical resistivity, thermo- 
emf, etc. were derived using pseudopotential formfactors for both 
liquid [524-526] and amorphous [527] metals. In the more general 
approach developed in [528] for liquid metals and extended in [529] 
for amorphous metals, 7-matrix formfactors were used. This approach 
is particularly well suited for transition metals because it poses 
no problems in constructing a pseudopotential when d electrons are 
present, which belong neither to core nor to outer shells. Both meth- 
ods are widely in use and will be discussed below. 

Basically, there are two approaches to the transport properties of 
disordered materials, one based on the nearly-free-electron picture 
and the other on the tight-binding approximation. We will only be 
concerned with the first approach. The second approach usually 
involves either the average 7-matrix approximation (ATA) or the 
coherent poteritial approximation (CPA) and is far beyond the con- 
fines of pseudopotential theory. The reader is referred to [530-533] 
for more details. 

The physical properties of noncrystalline materials are sometimes 
studied in parallel with those of disordered crystal alloys [534]. 
A feature of such alloys is their compositional (chemical) short- 
range order [412], as opposed to the topological short-range order 
specific of noncrystalline systems. Note, though, that compositional 
short-range order is also possible in noncrystalline alloys [535, 536]. 

The method discussed here assumes the model of a gas of conduc- 
tion electrons which interact with and are scattered by irregularly 
placed metal ions. As an external electric field drives the electron 
through the disordered medium, the scattering determines the 
electrical resistance, which can be calculated using perturbation 
theory or more general methods. This is in fact an extension of the 
Drude-Lorentz-Sommerfeld model, for which conductivity in the 
relaxation-time approximation is given by [537] 


o= + e2vbtN (Ex). (9.23) 
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Here e is the electron charge, vp is the velocity of electrons at the 
Fermi level, N (Ep) the density of electronic states on the Fermi 
surface, and the relaxation time Tt is defined by 


ci \ (1— cos 8) Q (9) dQ, (9.24) 


where @ is the scattering angle, Q is the solid angle and Q (®) is the 
probability for scattering through the angle 0 determined by 


Q (0) =F (kU |’) |? N (Lp) /4n, (9.25) 


where (k | U |k’) is the matrix element for the transition of an 
electron from a state k’ to a state k caused by the potential U (r). 
The factor 1/2 indicates that the electron spin is not affected by 
scattering. The potential U (r) is the superposition of ionic poten- 
tials, 


U (r) => U(r—r;), (9.26) 


where the r; indicate the positions of the (irregularly arranged) ions. 
In the local approximation, the matrix element (k | U | k’) may 
be considered as the Fourier transform of the potential U (r) 


(k|U|k’y = l U (r) exp [i (k—k')r]dr=U (q) X, eè, (9.27) 


1 


where a normalization to volume is assumed and q = k — k’ is the 
scattering vector. 

We invoke the diffraction model of neutron and X-ray scattering 
[485] to write 


[ik |U 1k’) 2 = |U (q) 2a (a), (9.28) 


where 





1 igr; |2 
TELIRIS oa 
is the structure factor (or interference function) and can be directly 
estimated from X-ray or neutron scattering data. L is the number of 
ions. Substituting (9.24), (9.25), and (9.28) into (9.23) yields 

1 
te 

T ) [U (0 a(g) nan. (9.30) 
0 
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pı = he? 
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This shows that the electrical resistivity of liquid metals and of 
their analogs is determined by the square of the modulus of the 
pseudopotential formfactor and by the structure factor of the media 
(this is, of course, for simple metals, to which pseudopotential 
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theory is applicable). Equation (9.30) is helpful both in evaluating 
electrical resistivity and in determining its behavior under various 
conditions. For example, the temperature coefficient of resistivity 
for constant volume [525] is 


tPA eH, ea 


where @, is the thermal expansion coefficient, and it can be seen 
that a qualitative argument about the upper integration limit and 
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Fig. 9.4. Structure factors of liquid metals for different temperatures (7, < 
Ta) as functions of the scattering vector q (q; = 2kp for a metal of valency 
Z;). From Khar’kov et al. [538]. 


the integrand in (9.30) might be sufficient for predicting the sign of 
the effect. Figure 9.1, which is taken from [538], demonstrates this 
for liquid metals of various valencies. The figure shows the approxi- 
mate behavior of the structure factor for two temperatures (Ti < T2) 
and of the product | U (q) |? a (q) for T,. The position of the upper 
integration limit in (9.30) (q = 2k,) is also shown. It will be seen 
that for monovalent metals the upper limit is on the left wing of the 
structural maximum, while for bivalent and polyvalent metals it 
is on the right. 

The q-dependence of the product | U (q) |? a (q) turns out to be 
nonmonotonic and can be conveniently described in terms of the 
parameter qo, the value of g which makes | U (q) |? zero (see al- 
so [73]). The region q < q corresponds to what is known as plasma 


electrical resistivity of for which scattering by relatively large 
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density nonuniformities is relevant; the region q > qa corresponds 


to the structural resistance of". The total resistivity is the sum 


of both. 
Under the simplifying assumptions 


Uig=W@, (W@))> +|WOR=+(FEr)» 
a(q)~a(O), Q/2kp ~ 0.8 = no, 
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Since according to Ziman [167], 
a (0) = kpT/(.Ms?), (9.33) 
where M is the ionic mass and s the velocity of sound, it follows that 


of! increases linearly with 7 and is inversely proportional to the 
velocity of sound (which is in turn temperature dependent). This 
behavior is characteristic of monovalent liquid metals, in which p?? 
is the main contribution to the electrical resistivity. 

Turning to structural resistivity, we may write 





1 
12x 1 4 
o= ia ap T | W Oal dn. (9.34) 


No 


It can be seen from this that the temperature dependence of pi" 


(and hence ptotal) for constant volume comes through a (q). A gene- 
ral argument about the structure of disordered materials suggests 
that disorder should increase with temperature. This means that 
the maxima of the structure factor will be smeared out so that a (q) 
will decrease in the peak and increase between the maxima. Simple 
qualitative considerations like this (see Fig. 9.1) allow us to predict 
that the derivative (0/07)pst™ must be negative or slightly positive 
for bivalent metals and very close to zero for polyvalent metals. 

We will now extend the above argument. to other transport prop- 
erties and to the transition metals, postponing for a while a quan- 
titative discussion of the structure factor. 

We first consider thermo-emf. If we assume that the Fermi surface 
of conduction electrons is spherical, it is well known [538, 524] 
that the thermo-emf Q and electrical resistivity are related by 


_ wkpl ð lno (£) 
Q= — laine dene, (9.39) 
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This expression holds for any metal or alloy, whether crystalline 
or not. It can be shown that the dimensionless parameter 


A ô In p (E) 
t= —| ôln E Jeze, (9.36) 


determines the dependence of thermo-emf on microscopic character- 
istics of the metal so that it is worthwhile investigating this para- 
meter within the nearly-free-electron model. After introducing the 
cross section for the scattering of an electron by an ion, 


1 











0: = a \ a (q) |W (q)? 18 d (9.37) 
0 
we write 
p= oy. (9.38) 
Since vp = (dikp)/m =(h/m) V (802ZN)/V, we have 
ve/Z œ 1/(VEF) (9.39) 
giving 
p (E) = 4 2P: (9.40) 
Thus 
_f dome ô lno; (E) _ 
E Racy ~ OnE ee, 717" (9.41) 
where 
— ô ln Oi (E) 
r= eer ee, (9.42) 


If o; is independent of energy, then r = 0 and & = 1, so that Q = 
x’kgT/3eEp, and the sign of the thermoelectric effect is deter- 
mined by the sign of the electric charge, as it normally should in 
a metal. For r = 1 we have Ẹ = O and no thermo-emf. For r > 1, 
the thermo-emf has an unusual sign. Ziman and his co-workers [525] 
expressed r in terms of the structure factor and pseudopotential 
parameters. According to (9.37), o; = o; (kr), and for the free- 
electron model we have 


r= —2+2g, and §=3—2g), (9.43) 
where 
_ a(2kp) |W (2kF)|* 
80 = aa) IW (a1?) (9.44) 
and 





1 
(a (IW (a) 1) = zgr | (9) IW (a) m8 dn. (9.45) 
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It can easily be seen that g is non-negative because so are all the 
quantities which make it up. This parameter is only zero if W (2kp)= 
0, i.e., 2kķkr = go. For local pseudopotentials g, œ 0.8 x 2k, and 
so usually gy >>0O and § <3(€ =3 when W (2k-) = 0). If 
a (2kp) | W (2kp) E = (a| WFP), then g > 1.5, ro = 0, € = 1, 
and the thermo-emf has a normal sign, as already mentioned. The 
similar situation occurs if 0 <g< 1.5. For gy >> 1.5 the quantity & 
is negative, making the sign of thermo-emf unusual. This is possible 
if the product a (2k) | W (2k) |? becomes very large near q = 2kp, 
which is usually the case when 2k, is close to a maximum of the 
structure factor a (q). The quantity go is at its largest if a maximum 
of | W (q) |? also corresponds to the condition q œ 2kp. As the pro- 
duct a (q) | W (q) F also enters the expression for electrical resis- 
tivity, the resistivity should be high if the thermo-emf is negative. 
A qualitative argument has thus led us to an important relation 
between thermo-emf and resistivity. 

The transition metals may be treated in a similar fashion, although 
instead of pseudopotential formfactors, the ¢t-matrix formfactors 
of muffin-tin potentials are better [528] (see also Sec. 2.5). We have 


Q (0) ~ |k [t] k)? N (EF), (9.46) 

where the ¢ operator is defined by 
toy = Uy, (9.47) 
where g, and fk are the unperturbed and perturbed functions, res- 


pectively. The scattering amplitude is expressed in terms of the 
formfactor, 


t(k, k’)= —2 k |t] k’), (9.48) 


and the formfactors ż (k, k’) form the ¢-matrix. The ¢ operator obeys 
the integral equation 


iPr =U (Pk + GU yx) = UE, + UG toy, (9.47a) 


where Go is the free-particle propagator and U is the true scattering 
potential for the material under study. 

The single-scatterer t-matrix formfactors for energy-conserving 
transitions are [528] 


2m3 { . 
= im EZ (sr) 2 (21+ 1) sin n (EF) 


x exp [in, (Ey)] Pı (cost), (9.49) 


t (k, k’)= 


where J is the orbital quantum number, 1; is the phase shift as de- 
rived from the Fermi energy, and P;, a Legendre polynomial. Substi- 


Ch. 9. Pseudopotential calculations of metals properties 303 


tuting this into (9.23) and changing from o to p we find 


1 
p= ee \ Avpa (q) |t (k, k’) dn. (9.50) 


e? h?y 2 
F 0 


A technique for calculating resistivity by (9.50) is outlined by 
Dreirach et al. [539]. Their results show that the d-components dom- 
inate in the transition metals. For nickel, for example, the phase 
shifts No, 1 and ù are —0.25, 0.02, and 2.892, respectively; for Fe, 
the figures are —0.479, —0.071, and 2.520. We may then write to 
a good approximation [528] 


307343 . o 

~ ae I} 
p= Me kRE p Sin” Ye (Er) a (2hp). (9.51) 
The energy dependence of na is resonant in nature, and if E and T 
denote the position and width of the resonance respectively, we have 


l tan n (E) =T/(Eọ— E). (9.52) 
In this approximation 


3073 K3 Yr? 

P= nea, e Ep oA) FEE Seppe (9.53) 
The resistivity and thermo-emf calculated by these formulae strong- 
ly depend on the relative positions of Ep and Ep, and if the calcu- 
lation parameters are appropriately chosen, a reasonably good fit to 
the experimental data is obtained. For nickel, for example, the 
experimental values for p and Q are 85 pohm-cm and —36 pV/erad, 
respectively. The computed values are 227 and —38 for kp = 
0.851 a.u., while they are 106 and —33 for kp = 0.930. 

Adjustable parameters are also used to calculate the transport prop- 
erties of nontransition metals. For example, the resistivity of Na 
computed from an adjusted Ashcroft potential [538] was found to 
be 9 pohm-cm, which is rather close to the experimental value of 
9.6 pwohm-cm. Results for the other metals are reviewed in [538, 
165, 541]. It should be noted, however, that the good agreement with 
experiment is only evidence for the promise of pseudopotential theo- 
ry rather than an indication of its completeness. 

We have already mentioned that reliable information about the 
structure factors is crucial for fitting pseudopotential calculations to 
experimental data, so the temperature and composition dependence 
of the structure factor are definitely worth discussing for disordered 
materials. 

The relationship between electrical resistivity and the structure 
of disordered binary alloys is discussed in much detail by Bhatia and 
Thornton [542] who obtained the important result that the alloy 
structure factor may be represented as a sum of two temperature- 
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dependent terms. The first terin is associated with topological short- 
range order and describes the arrangement of the atoms of an arbitrary 
species in noncrystalline materials, which may in principle contain 
any number of components. The second term depends on the compo- 
sitional short-range order and describes the arrangement of the 
different species of atoms against the background of the overall 
arrangement of atoms. 

We will first consider the temperature dependence of the structure 
factor using, as Bhatia and Thornton [542] did, an analogy with 
neutron and X-ray scattering and employing van Hove’s idea 
[543] of a dynamic struclure factor. The structure factor may be 
written, in general, as the sum 


S@M=S@+ti@Q+tS%@+--5 (9.54) 


where the subscript indicates the number of phonons involved in 
the scattering process. When there are none, the scattering is elastic 
and 


So (q) =a (q) e724), (9.55) 
where 


a(q)=— >) ert? (9.56) 
i F 

is the interferential function, which is the scattering intensity 
summed over the average positions of ions; the Debye-Waller tempera- 
ture factor e~*(a) describes the reducing effect of thermal motion 
on the main maximum. The higher terms in (9.54) correspond to one-, 
two- or multiphonon thermal diffusion scattering. In the simplest 
case of independent atomic displacements, thermal diffusion scat- 
tering is proportional to (1 — e°). 

The first few terms of (9.54) were evaluated in [534, 542, 544] in 
the Debye approximation. The dynamic structure factor S (q, o) 
may be represented as 





S (q, ©) = So (q, o) + Si (q, ©) +.. (9.57) 
where 
So(q, ©) = a (q) e726 (0), (9.58) 
27 — hq? l 
Sil, o) = 072 E S AE fa (q+ e) ê (ot o) 
Ç 


+a(q— p) ô(0— o)l, (9.59) 
alq +o) =- > D expli(a +e) (r;— ry) p=(p, i) (9-56a) 
jf’ 


i and @ are the polarization index and the wave vector of an elastic 
wave; x = fhiw/(kgT); and n (œ) = (e — 1)7. 
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The structure factor of X-ray and neutron scattering, 


S*(q)= | S(q, 0) do (9.60) 
and the structure factor of electrical resistance theory, 


S (q)= | S(q, o)an(2) do (9.64) 
are Clearly the averages over the frequencies of the elastic waves [544]. 


In the Debye approximation, it can be easily seen that S% (q) = 
SE (q) because in this case œ ~ 0 according to (9.58), and hence we 


have lim zn (x) -= x (eï — 1) œ~ 1. In the same approxi- 

x->0 
mation, 
x hg? 
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S= — Zo D y lalate) Hala) n(x) n(—2) 
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1 
=a (4) (0/7) | (2) a ($) n(@n(—2) | -alate (9.63) 
0 


PD 





where z = hiw,/kpT = (hog/ky T) (0/Pp ) = (8/T)/(G/@p } Pp is 
the wave vector of the elastic wave which corresponds to the radius 
of the Debye sphere; O is the Debye temperature; and a (q) = 
3 (hq)? (exp (— 2M (q))/.\fkg9). When «a (q) and 60 are specified 
in S* (q) and Se (q), the static structure factors can be evaluated to 
a required accuracy. 
In the high-temperature limit 
1 


S$ (q) = Si(q) (4) j a (ar) \ alate) 


=a (q) AX(q), (9.64) 


where the second integral averages a (q) over the Debye sphere, which 
is circumscribed about the vector q. At high temperatures A* (q) = 
A? (q) ~ 1 for any q. The low-temperature expressions for A (q) 
[544] are too cumbersome to reproduce. It can be seen from the 


20—01063 
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expressions for Sy (q) and Sı (q) that So (q) decreases with T in 
proportion to the Debye-Waller factor e-2M(q@), whereas S, (q) in- 
creases as the product of the temperature and e-2™(q). 

The (multiphonon) static structure factor for electrical resis- 
tivity is [545] 


S° (q) = So (a) + SP (q) + (1 —[14 2M (g) e-24}, (9.65) 


where Se (q) and Se (q) are given above and the last term does not 
exceed 2M? (q) unless the temperature is very high. For 7 < 8 we 
have to a good approximation 


S° (a) = So (a) + Si (q). (9.66) 


In the low-temperature limit S? (q) œ (7/8)?. 

At arbitrary temperatures, the temperature behavior of the static 
structure factorsis discussed [545] in terms of averaged structure 
factors 


fp 


A*(q)= | 9 do [n+] | Gee (late) 
0 
?D 
x {| ede [ne] (9.67) 
and i 
D a 
A°(q)= | de en(a) ina) +4} | Fa (lat ol) 
0 
?D 


x{] degn(z)In(a) +11}. (9.683) 
0 


If multi-phonon effects are ignored, the static structure factors are 


S* (q) = a (q) e-2™ +. 2M (q) A” (q) 
a (g) e-MO + AX(q) {1 — e-o} (9.69) 
and 


S°? (q) —a(q) e~2M(q) 1 (q) (+) ° A® (q) Jo (+) ’ (9.70) 


where 


Ja ( £ )= 


dz z?n (x) [n(x)+ 1] 


a je 


© 


1 
E (+) \a(2) (=y n(x) [n (x)+ 4} (9.74) 
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and 


2M (q)=3{(hg)?/(Mky)] (4-)° | a [n (2)++| dz. (9.72) 


ot ahy] D 


As the integrals J, (8/7) and M (0), are well known in the Debye 
approximation, the quantities So (q), S*¥ (q), and Se (q) may be evalu- 
ated as functions of the temperature and scattering vector, and the 
temperature dependence of resistivity will therefore be determined 
for various g-to-2kp ratios. A detailed calculation of this kind was 
performed by Meisel and Cote [545] who showed that the A* (q) and 
A? (q) curves are Gaussian in nature, both having a maximum at 
a certain value of go. As the temperature increases, the mixima get 
closer together and are virtually indistinguishable at about T ~ 0. 
Meisel and Cote also calculated the Se (7)/Se (6) ratio for various 
go and showed that the temperature behavior of the ratio varies 
substantially depending on whether the go values are near the maxi- 
mum or far from it. In the former case the ratio varies as T? at low 
temperatures, passes through a maximum at T œ (0.3-0.4)0 and 
then decreases. In the latter case there is no decrease with increasing 
temperature. The interpretation of the negative temperature coeffici- 
ents in amorphous, liquid, and disordered crystalline alloys is un- 
doubtedly an important feature of the diffraction model. Note that 
while the model is particularly well suited for amorphous metals, 
other methods may be applied to liquid metals [545] or crystalline 
disordered alloys. It has been shown, for example, [546] that when 
heat-treated, a crystalline disordered alloy may become less resis- 
tive due to the temperature dependence of the parameters of com- 
positional short-range order. 

The pseudopotential theory of resistivity of liquid metals was 
extended by Faber and Ziman [526] to include alloys of the nontran- 
sition metals. For a binary alloy of components «æ and P with atomic 
concentrations Cg, Cg and molar volumes Va, Vg, they found 


4 
123 IT m 
m= gar | On dn, (9.73) 
0 


where V is the averaged molar volume, and 


IW (q)? =N {Tq Tima + > Se, ceU a (9) Un (q) a ) aap, (9.74) 


a=1 p=1 


where W (q) is the matrix element of the alloy pseudopotential, 


Ui(9) = 





1 (i=1, 2), 


20* 
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N is Avogadro's number; W, (q) and Wp (q) the formfactors of the a 
and f ionic pseudopotentials, and agg the partial structure factor 
for electron-wave interference from an atomic pair af. It is as- 
sumed above that the arrangement of unlike atoms is entirely deter- 
mined by their separation and is independent of other properties of 
the elements. In this approximation, 


U?(q) = eU; (4) +203 (q), (9.75) 

U (q) =c,U (4) + U2 (9), (9.76) 

U (q)? = eUY (Q) + 261620, (QU2(q) +6303 (a) (9.77) 
By substituting (9.75) and (9.77) into (9.74) we obtain 











p= ZE [C101 + C202 + C4C3 (o) — 20}, + o, — o, + 2042—6)], (9.78) 


where Z = cZ, + C229, 
1 


Am? U s 9.7 
o= f ai (9) U: (QU; (@ n dn, (9.79) 
p A 
0% => | Ui (0U; a)n dn, (9.80) 


0 
and 


O = 04, O2 =02 O =O], Oz = 03. 
Drawing an analogy with solid solutions, Faber and Ziman intro- 
duced the idea of a perfect liquid alloy defined by 
lii = a23 = A33 = A, V= Va (9.81) 
If we define 


o 12 — 2 FT 7 — 
Qi = Go ’ O12 = Go, ’ 022 = -5 » li = Ay2 = l2 5 a, (9.82) 


the resistivity of such an alloy is 
1 


p= Tee {e104 + 002+ cica (1—0) Sar | (Ws (0—00? dn) , 
' (9.83) 
which may be rewritten as l 
=o +p" (9.84) 
with 
1 _ "WEF C101 + €402 (9.85) 


T e cıZı + BYA , 
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and 


m 


” 4m? i 
"=a azy ae (1-9) \ U, (a) — U: (g)? ndn. (9.86) 


oer 


In polyvalent metals, Faber and Ziman argue that a is close to unity, 
giving 
0 ~ p. (9.87) 


For the transition metals, it has been suggested [528] that better 
results for the resistivity of noncrystalline alloys or disordered crys- 
talline alloys will be obtained by replacing | W (q) |? by an averaged 
T-matrix element, for example, in the form 


(Tan)? = Ca ltal [1 — Ca + Celaa (9)] + ce Itel? [1 — cg + gage (a) 
+ Calg (tatg + tath) lagg (9)— 1l, (9.88) 


where ta is the t matrix of the @ species. 

There is strong evidence that the arrangement of atoms of differ- 
ent kinds is not random both in crystalline [412] and noncrystalline 
[535, 536, 547, 548] alloys, evidently due to differences in interatomic 
interactions even between atoms of equal valencies. This led Bhatia 
and Thornton [542] to introduce new structure factors different 
from dz, and similar to the | S (q) |? and | C (q) |? functions we de- 
fined in Chapter 6. Asin Chapter 6, | S (q) |? represents the structure 
of the “average” crystal (i.e., periodic density fluctuations) and 
| C (q) |? describes the density fluctuations caused by fluctuations in 
composition, while the interferential term S (q) C (q) vanishes in 
a periodic crystal (see Sec. 6.2). The functions introduced by Bhatia 
and Thornton for a noncrystalline medium are Syw (q), Sce (q) 
and Sy¢ (q). The first describes the arrangement of sites in a dis- 
continuous network as a whole, the second describes the arrange- 
ment of atoms of different species on the sites of the network, and 
the third is the interferential term. Clearly, Syw (q) corresponds to 
topological short-range order and Sce (q) to compositional short- 
range order, and we refer the reader to [535, 536] for the explicit 
form of these functions. It is worth noting that partial factors like 
those discussed here are convenient for the calculation of the total 
energy of noncrystalline alloys [550]. Here we limit ourselves to quot- 
ing some of the expressions derived in [539] for resistivity. 

It is shown in [549] that if scattering is weak in an alloy the tran- 
sition of an electron from a state k’ to a state k may be character- 
ized by the function 


T (q, 0) = \ e- jot dt (A* (q, 0) A (qa t)), (9.89) 
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where 


A(q, t) => W ; (q) exp [iq (R; (t)], (9.90) 


Aw = Ek — Eg, and W; (q) is the pseudopotential formfactor for 
the ion located at R; 
For a pure metal 


r (q, ©) = |W (q)/? S (q, ), (9.91) 


where S (q, œ) is the dynamic structure factor of van Hove [543]. 
Electrical resistivity is given by 


1 
12 ¥)2 
p= | Tav (0 wan, (9.92) 


0 
where B=fi/Kpf, and Zay (q) is 


ra= | qos PG 0) do (9.93) 


averaged over all directions except the one corresponding to Bragg 
scattering. 
For a binary alloy, we have 


Tía, 0)= |W]? Syy (q, ©) + |W, — W2!” Scc (q, ©) 
+20 (W, — W3) Sye(a, ©) (9-94) 
instead of (9.91). Here 





Suy (4, ©) =z | 67i di (N* (a, 0) N (a, t), 
Scc (A, ai | e-fetae (C* (q, 0)C (q, D), (9.95) 
Sold, ©) =e | 07®tdt(N* (q, 0)C(q, t)+C* (a, 0) N (q, t), 


where N* (q, t) and C* (q, t) are the Fourier transforms of density 
fluctuations and composition fluctuations, respectively. The func- 
tions Syy (q, ©), Sce (q, œ), and S yc (q, œ) are the dynamic struc- 
ture factors of the alloy. The first corresponds to fluctuations of the 
average density as a whole, the second corresponds to the composition 
fluctuations, and the third function is the interference of the first two. 
Clearly S yy (q, œ) is an analog of the dynamic structure factor of 
a pure metal. - 
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By writing 
Syn (q) = f [Bw/(e8® — 1)] Syy (q, ©) do, 


Sco (4) = | IBo/(e® —1)1 Sco (q, ©) do, (9.96) 
2Sxc (q) = | [Bo/(e8® — 1)] 28y¢(q, ©) do, 
we get 
F(a) = IP Syy (q) + W, — Wl? Scc (0) +2W (Wi — W3) So (4), 


(9.97) 


which is the differential intensity of scattering from a state k’ to 
a state k =k’ +q. At high temperatures iw/kg T< 1, so the 
factor Bw/(e® — 1) may be replaced by unity, giving 


Syy (q) = Lt (N* (q) N (qa), 
Sce (q)= L (C* (q)C(q)), (9.98) 
Syce (q) = Pe (N* (q) C (q)). 


The first of these expressions is the high-temperature structure 
factor for a pure metal or for an alloy; in the latter case Syy (q) 
is an average over all the species of an atom, viz., 


Syni = CD D e's”) — L8q,0. (9.99) 


The other two expressions in (9.98) are only meaningful for alloys. 
Bhatia and Thornton [542] also found that the structure factors 
we are discussing are related to the atom-atom structure factors 


Aas = 1+ | [eae — 1] #2 SR anr® dr, (9.100) 
0 


where Qag (r) is the probability of finding an @ ion within a unit 
volume already occupied by a ß ion at a distance r from the origin. 
The probability is normalized to unity at large r. According to [542], 


Syy (9) = C7444 T Câ + 261024142, 
Syo (9) = lil [C4 (a11 — a22) — C2 (a22 — 242)), (9.101) 
Sce (4) = eyez [1 -+ C4C (a11 + a22 — 2412)] 

which shows that if the partial structure factors are equal, then 


Syy (0) =a (4), Suc (q) =0, and Seg (9) = C4162, (9.102) 
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which corresponds to the perfect solution case. It was found that 
in both the Percus-Yevick and the hard-sphere models the function 
Scc (q) slightly oscillates about cyc,, S yc (q) oscillates about zero, 
and Syw (q) is represented by a sequence of bell-shaped functions 
which get lower and wider with g. The “bells” are not entirely sym- 
metric. 

The structure factors like S yc arerelated to the short-range order 
parameters of a noncrystalline alloy, which characterize the arrange- 
ment of (both like and unlike) atoms in the alloy. This can be 
seen from the Fourier transforms of the structure factors, 


2 . 

Gun (T) = pg \ g [Syn (g)— ij sin gr dq, 
2 , 

Gwo (r)=Ż | gSwc (g) sin qr da, (9.103) 
2 ; . 

Gec(r) = j g iS ce (9) — €1¢2]/e,co} sin gr dq, 


which are in turn related to the correlation probability functions 
Ewn (r), Enc (r), and Sec (r) for the density and composition fluctu- 
ations and to the densities py vn (r), Pnc (r), and pcc (r) of these 
fluctuations. We have 


Gyn (T) = 40r [Pyy (T)— Pol, 
Gye (r) = 4arpye (r), (9.104) 
Gco (T) =4nrpce (r). 


The corresponding coordination numbers are obtained by integrating 
the fluctuation density functions p y vn (r), One (r), and pcc (r) over 
the “bells”, viz., 


Tmin 
a 


Zyn = | ánr?pyy (r) dr, 
0 


min 
2nc= f ánr?poyc (r) dr, (9.105) 


m 
Bcc = | á4nr?poce (r) dr. 
0 


Clearly, 
o: (r) = > pi (r) (9.106) 
J 
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is the total number of (any kind of) atoms at a distance r from an 
ith atom. The corresponding coordination numbers are 


Z4 = Zyn + 2nclCy= 2444+ 242, 





Z2 = Zyn — ZyclCg = 204 + Z22, (9-107) 
giving, by (9.104), 
Zyn = C121 F C2292, (9.108): 
Zye = C4C2 (24 — 22), (9.109): 
Zoo = C2214 + 6422-22 (9.110): 


C2 


where Zyn and Zcc are the coordination numbers for topological 
and compositional short-range order respectively. Equation (9.110) 
may be rewritten as 


Zee = (C224 + C422) {1 — EAEAN = 2yhy (9.111): 








with 
Zw = C224 T C422 = Zyn 1 aa Zye (9.142) 
and 
Z 
a,=1—-—", 9.113) 
w Toc, ( ) 


the latter expression also gives the (compositional) short-range or- 
der parameter in disordered crystalline alloys [411, 412]. 
If zyc=0 or cy=Cg, then 2,,=2yy and 


Tmin 


a, =1— 2 | 4ar?pco(r) dr. (9.114) 


Co2NN ZNN 





0 


Katsnelson and Safronova [536] used the same approximation to 
derive a similar expression for binary noncrystalline alloys by con- 
sidering correlations in arrangements of unlike atoms. The same 
arguments were used by Katsnelson et al. [550] in pseudopotential 
calculations of the energy of such alloys and in finding the contribu- 
tions to the energy from topological and compositional short-range 
order. 

We may thus conclude that the electrical resistivity and other 
transport properties of noncrystalline alloys is determined by the 
temperature, composition, and interatomic interactions. If there 
is no dominant component in an alloy, it is sensible to use an average 
structure factor like S yy as a first approximation and then to cor- 
rect the result using Sec or S ye-type terms. Although the second 
step in this program is rarely possible because of the lack of relevant 
experimental data, encouraging results have been obtained. The 
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Fig. 9.2. Electrical resistivity of Cu-Au alloys as a function of concentration. 
Empty circles—theory, crosses—experiment. From Dreirach et al. [539]. 
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Fig. 9.3. Electrical resistivity of Ag-Sn alloys as a function of concentration. 
Empty circles—theory, crosses—experiment. From Dreirach et al. [539]. 


concentration dependence of resistivity has been explained for 
alloys containing elements from different regions of the periodic 
table. The concentration dependence of 0p/0T has been interpreted. 
A relationship had been found between electrical resistivity and the 
sign of its temperature coefficient, and finally, the relationship be- 
tween thermo-emf and anomalies in 00/07 has been explained. 
Dreirach et al. [539] calculated the electrical resistivity for the 
liquid alloys Cu-Au, Cu-Sn, Cu-Ge, Ag-Sn, Au-Ni, Ni-Sn, Fe-Ge 
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and some others. The most interesting data were obtained for noble 
metal-noble metal alloys (Cu-Au, for example) and for alloys of 
noble (or transition) metals and polyvalent metals (Cu-Sn, Ag-Sn, 
Cu-Ge, Fe-Ge). For the first type of alloy no region of negative 
60/0T was found (Fig. 9.2), implying that 2k, lies in the region 
q < max: max being the position of the main structure factor. The 
computed concentration dependence of resistivity agreed with the 
experimental data. For the second alloy type (Fig. 9.3), negative 
ĉp/ðT (and hence thermo-emf anomalies) were found near the maxima 
of the concentration dependence of p and corresponded to intermedi- 
ate valencies (between 2 and 3). It is noteworthy that the region of 
60/0T <0 did not coincide with the position of the resistivity 
maximum op. This means that the temperature dependence of p 
is not only determined by the maximum of the structure factor but 
also by its wings, where composition short-range order may be 
important. 

Similar results have been found for amorphous alloys (metallic 
glasses). In the Ni-P system, for example, 09/0T is positive at low 
content of P (15-20%) and becomes slightly negative at 24-26%, 
implying a temperature dependence of the structure factor upon the 
relative positions of 2ky and qmax [549]. Metal-metal amorphous 
alloys display similar behavior [545, 552] as exemplified by the 
systems Zr,,Be,,, ZrgjBe,,, Zr7,;Rh,,, and Zr;,Cu,,, in which the 
theoretical dependence S (T) agrees well with the experimental 
curve p (t) [552]. Some of the discrepancies could be easily removed 
by optimizing the calculation parameters and by including more 
structure details in the calculation. The agreement is more remar- 
kable considering that the low-temperature resistivity ranges from 70 
to 300 pohm-cm in these alloys. Although these are rather encourag- 
ing results, the quantitative description is not as yet entirely satis- 
factory, especially for very resistive alloys of about 200 pohm-cm 
[545]. There are three main reasons for this. 

First, a calculation of resistivity not only requires an accurate 
knowledge of the g dependence of the structure factor but also of the 
contributions to the structure factor due to pairs of unlike atoms or 
fluctuations of various types. This necessitates the same treatment 
background for samples in which transport properties are studied 
and for those from which structure data are derived. There are no 
studies, as far as we know, that fulfil this condition. Since the diffu- 
sion halo of the structure factor is narrower in amorphous systems 
than in liquid systems, calculations for the former necessarily in- 
volve more errors and agree worse with experiment [5414]. 

Second, the choice of the shape and parameters of the pseudopoten- 
tial, the screening functions, and other relevant characteristics re- 
mains arbitrary. For example, in diluted crystalline Li-Na alloys 
the residual resistivity can be changed two- or three-fold by varying 
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the core radius r,, exchange-correlation corrections, and the shape 
of screening function [553]. Indeed, different model potentials are 
needed for different alloys to fit the experimental data, however 
similar the electron structures of the alloys might be. Although the 
use of 7 matrix may improve the agreement with ex periment [503, 528], 
the results remain sensitive to the details of the calculation scheme. 

Finally, the fundamental approximations involved in resistivity 
calculations may be blamed for discrepancies, the concept of the 
mean free path being perhaps the most questionable. At resistivi- 
ties of the order of 150-200 wohm-cm the mean free path of the elec- 
trons is comparable with the interatomic separation and the nearly- 
free-electron model becomes inadequate. A more sophisticated the- 
ory of transport properties will perhaps remove this difficulty (see, 
for example, [530-533]). We may be certain, however, that the 
pseudopotential approach to transport properties will retain its 
value as a helpful first approximation. 
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